1 Introduction

Small, nanometre-sized beams are necessary to reach the required luminosity at future linear colliders. Together with the high energy, the small beams cause large electromagnetic fields during the bunch crossing. These intense fields at the interaction point squeeze the beams. This so-called pinch effect increases the instantaneous luminosity. However, the deflection of the particles also leads to the emission of Beamstrahlung photons—which reduce the nominal energy of colliding particles—and produces collisions below the nominal centre-of-mass energy [14]. The resulting spectrum of centre-of-mass energies is traditionally called the luminosity spectrum Footnote 1 [47].

The knowledge of the shape of this luminosity spectrum is mandatory for the precision measurements in which a cross-section has to be known. While the cross-section depends on the centre-of-mass energy, the observables measured in the lab frame also depend on the difference in energy of the colliding electrons,Footnote 2 which determines the Lorentz boost of the system.

Unlike the electron structure functions [i.e, Initial State Radiation (ISR)]—which can be calculated precisely—the beam-beam forces, and therefore the Beamstrahlung, highly depend on the geometry of the colliding bunches. The actual beam-beam interaction taking place at the interaction point cannot be precisely simulated, because the geometry of the bunches cannot be measured. Therefore, the luminosity spectrum at the interaction point has to be measured using a physics channel with well known properties, e.g., Bhabha scattering.

The observables measured in the events are affected by detector resolutions. The distributions used for the reconstruction of the luminosity spectrum are also dependent on the cross-section of the process, and initial and final state radiation (FSR). All effects have to be taken into account for the reconstruction of the luminosity spectrum.

It was pointed out by Frary and Miller [5] that a precise reconstruction of the peak of the luminosity spectrum, necessary for a top-quark threshold scan, can only be achieved with a measurement of the angles of the outgoing electrons from Bhabha scattering. The angles of the two particles are the most precisely measurable observable [5]. The angles of the outgoing electrons—or rather the acollinearity between the two particles—are sufficient to extract a relative centre-of-mass energy, which gives access to the luminosity spectrum.

Toomi et al. [6] showed that the reconstruction of a parameterised luminosity spectrum is possible using a template fit. Their parameterisation only used three parameters to describe the effective centre-of-mass energy spectrum. However, as the boost of the initial system and correlation between the energies of the two particles cannot be neglected [8], a description of the energies of the pairs of colliding particles is necessary. Correlations exist between the two particle energies because the probability to emit beamstrahlung depends on the distance travelled in the field of the opposite bunch, and the field strength depends on the position inside the bunch. As two particles can only collide, when they are in similar position in their respective bunches the energy between two particles is correlated.

The relative centre-of-mass energy, that is reconstructed from the acollinearity, is equal to unity for back-to-back particles, and always smaller than unity for larger acollinearity, regardless whether one of the particles has a higher or lower energy than nominal. Therefore, Shibata et al. [7] proposed to calculate the distribution of the four-vectors of the Bhabha electrons and extract the luminosity spectrum with the iterative Expectation–Maximisation algorithm. They have, however, considered neither detector resolutions, nor Initial and Final State Radiation. For a full description of the outgoing Bhabha electrons, the luminosity spectrum would have to be weighted with the Bhabha cross-section and convoluted with the detector resolutions, which would require a huge computational effort, when using their method.

A reconstruction of the energy of the particle pairs was done for the 500 GeV ILC [9]. The acollinearity and the energies of the electrons measured in the calorimeter were used in a reweighting fit to reconstruct the luminosity spectrum. The parameterisation—necessary for the reweighting fit—accounted for the correlation between the two beams and the beam-energy spread.

This paper follows the approach of the 500 GeV ILC study [9], extends it, and applies it to the luminosity spectrum of the 3 TeV CLIC [10], which is the most challenging luminosity spectrum. This paper is structured as follows: in Sect. 2 the basic and cross-section scaled luminosity spectrum are defined. The Bhabha scattering and observables used for the reconstruction are also introduced. In Sect. 3 the model of the luminosity spectrum, required to perform a reweighting fit, is constructed. The reweighting technique is explained in Sect. 4, and in Sect. 5 it is applied to first validate the model against the luminosity spectrum at the 3 TeV CLIC; then all the relevant effects leading to the measured observables are included, and the luminosity spectrum is reconstructed from these distributions. In Sect. 6 the impact of the reconstructed luminosity spectrum on the measurement of the masses of supersymmetric particles in a CLIC benchmark process is estimated. The paper closes with a summary, conclusions, and outlook in Sect. 7.

2 Luminosity spectrum, Bhabha scattering, and the measurement

The nominal centre-of-mass energy \(\sqrt{{s_{\mathrm {nom}}}}\) of a collider with two beams with the nominal beam energy \(E_{\mathrm {Beam}}\) is given by \(\sqrt{{s_{\mathrm {nom}}}} = 2E_{\mathrm {Beam}} \). If the two interacting particles carry only a fraction of the nominal beam energy \(x_{1,2} =E_{1,2}/E_{\mathrm {Beam}} \), the effective centre-of-mass energy is

$$\begin{aligned} \sqrt{s^{\scriptscriptstyle \prime }} = 2 E_{\mathrm {Beam}} \sqrt{{x_{1} x_{2}}}. \end{aligned}$$
(1)

The basic luminosity spectrum \(\fancyscript{L}\bigl (x \bigr )\) describes either the distribution of the fraction of centre-of-mass energies \(x =\sqrt{s^{\scriptscriptstyle \prime }}/\sqrt{{s_{\mathrm {nom}}}} \). or the distribution of the fraction of energies of colliding particles \(\fancyscript{L}\bigl (x_{1},x_{2} \bigr )\) prior to hard collisions and prior to initial state radiation. The two functions are connected via the integral along the lines of constant centre-of-mass energies, given by Eq. (1). Therefore,

$$\begin{aligned} \fancyscript{L}\bigl (x \bigr ) = \int \int \limits _{0}^{~~x _{\max }}\!\mathrm {d}x_{1} \mathrm {d}x_{2} \,\delta \bigl (x- \sqrt{x_{1} x_{2}}\bigr )\fancyscript{L}\bigl (x_{1},x_{2} \bigr ). \end{aligned}$$
(2)

The luminosity spectrum affects all centre-of-mass energy dependent observables. For example, the luminosity spectrum has to be used to predict the inclusive (i.e., observed) cross-section \(\sigma _{\mathrm {Eff}}^{\mathrm {Machine}}\) at the machine. The principle is the same as for the parton density functions at hadron machines, except that the luminosity spectrum depends on the machine and not only on the colliding particles. To calculate the effective cross-section the differential cross-section is weighted with the luminosity spectrum, either with the one-dimensional luminosity spectrum

$$\begin{aligned} \sigma _{\mathrm {Eff}} ^{\mathrm {Machine}} =&\int \limits _{0}^{x _{\max }}\mathrm {d}x \; \fancyscript{L}\bigl (x \bigr ) \sigma \bigl (x \sqrt{{s_{\mathrm {nom}}}} \bigr ) , \end{aligned}$$
(3)

or for the two-dimensional luminosity spectrum

$$\begin{aligned} \sigma _{\mathrm {Eff}} ^{\mathrm {Machine}} =&\int \int \limits _{0}^{~~x_{\max }}\!\mathrm {d}x_{1} \mathrm {d}x_{2} \;\fancyscript{L}\bigl (x_{1},x_{2} \bigr ) \sigma \bigl (\sqrt{x_{1} x_{2} s_{\mathrm {nom}}}\bigr ). \end{aligned}$$
(4)

Bhabha scattering is the process of choice for luminosity measurements. It can be calculated with high precision and has a large cross-section. To first order, the differential Bhabha cross-section is [11]

$$\begin{aligned} \frac{\mathrm {d}\sigma _{\mathrm {Bhabha}}}{{\mathrm {d}\theta }} = \frac{2\pi \alpha ^{2}}{s}\frac{\sin {\theta }}{\sin ^{4}\left( \theta /2\right) }, \end{aligned}$$
(5)

where \(\alpha \) is the fine-structure constant and \(\theta \) the polar scattering angle.

Because cross-sections \(\sigma _{x}\bigl (\sqrt{s^{\scriptscriptstyle \prime }} \bigr ) \) depend on the centre-of-mass energy, any process used to reconstruct the basic luminosity spectrum will inherently contain a scaled luminosity spectrum

$$\begin{aligned} \fancyscript{L}_{\mathrm{scaled}}\left( {x}\right) = \frac{\sigma _{x}\bigl (x \sqrt{{s_{\mathrm {nom}}}} \bigr )}{\int _{x ^{\prime }_{\min }}^{x ^{\prime }_{\max }} \mathrm {d}x ^{\prime }\sigma _{x}\bigl (x ^{\prime }\sqrt{{s_{\mathrm {nom}}}} \bigr )} \fancyscript{L}\bigl (x \bigr ). \end{aligned}$$
(6)

This means that it is not enough to reconstruct the observed centre-of-mass energy spectrum, the luminosity spectrum has to be extracted from the observed spectrum.

The observed centre-of-mass energy is further affected by initial state radiation. It is impossible to distinguish between energy loss from initial state radiation and Beamstrahlung on an event-by-event basis. initial state radiation and Beamstrahlung have to be disentangled statistically.

Finally, the scattered particles are recorded in the detector, where their properties are reconstructed within the limits of the resolution of the respective sub-detectors.

2.1 The basic luminosity spectrum

The luminosity spectrum distribution can be seen as a convolution of the beam-energy spread, which is inherent to the accelerator, and the Beamstrahlung due to the Beam–Beam effects. Figure 1 shows the beam-energy spread of the 3 TeV CLIC machine. It is obtained from a simulation of the main linear accelerator and the beam delivery system [12].

Fig. 1
figure 1

Energy distribution of the CLIC beams [12]. a Dependence of the particle energy on the longitudinal position of the particles along the length of the bunch, where the beam travels towards the left. b The energy distribution of all particles

The energy of a particle depends on its longitudinal position in the bunch (Fig. 1a). Due to intra-bunch wakefields, particles in the front of the bunch gain more energy from the RF cavities than particles in the back of the bunch [13]. This leads to the two distinct peaks near the minimal and the maximal value of the beam-energy spread (Fig. 1b). The energy spread is not following a Gaussian distribution. The dependence of the particle energy on the longitudinal position also leads to larger correlations between the particle energies. The precise shape of the beam-energy spread depends on the RF-phase and the bunch length [13]. To avoid a loss in the luminosity, these parameters are not allowed to vary freely and have to be precisely controlled [10]. Therefore, it can be assumed that a limited knowledge of the shape of beam-energy spread is available.

The distribution of particles is used as the input to the beam-beam simulation. The simulation of the beam–beam effects is done with GuineaPig  [4]. During the bunch crossing the intense electromagnetic fields—due to the opposing bunches—deflect the beam particles and cause Beamstrahlung.

Figure 2 shows the full range and the region around the maximal energy of the two-dimensional luminosity spectrum. The square region in the distribution of the two energies is due to the beam-energy spread (see Fig. 1b). Events with \(x_{1} < 0.995\) or \(x_{2} < 0.995\) were significantly affected by the Beamstrahlung.

Fig. 2
figure 2

a Energy spectrum of colliding particles as simulated with GuineaPig for 3 TeV CLIC. b Zoom of the luminosity spectrum around the nominal beam energies

Figure 3a shows the basic luminosity spectrum with respect to the effective centre-of-mass energy \(\sqrt{s^{\scriptscriptstyle \prime }}\). The spectrum possesses a peak around the nominal centre-of-mass energy and a long tail down to less than 5 % of the nominal centre-of-mass energy. Figure 3b shows the peak of the luminosity spectrum as it is produced by GuineaPig. Because the beam-energy spread is not Normally distributed, the centre-of-mass energy peak is not Gaussian either. Figure 3b also shows a spectrum obtained by randomly pairing the energies of two particles, i.e., removing the correlation between the energies of the two beams. There is a clear difference between the two cases. If the correlation between the particle energies is not taken into account, the luminosity spectrum cannot be reconstructed properly.

Fig. 3
figure 3

a Luminosity spectrum for 3 TeV CLIC as simulated with GuineaPig. b The peak of the luminosity spectrum with and without correlated particle energies

To describe the beam-energy spread—and anchor the luminosity spectrum—the absolute energy of the beam has to be known. The average beam energy can be measured on a level of 0.04 % [10] with a dipole and beam position monitor in the beam delivery system of the accelerator. If the distribution itself can be measured as well is still under study.

2.2 Cross-section-scaled luminosity spectrum

The observed events are distributed according to the scaled luminosity spectrum (Eq. (6)), thus the events obtained from GuineaPig have to be either weighted with very large weights, or sampled with an accept–reject method [14] to obtain events with constant weights. Because large event-weights are undesirable, the accept–reject method is chosen.

The 3 TeV CLIC luminosity spectrum extends over more than three orders of magnitude of the Bhabha cross-section, meaning the accept–reject method is very inefficient. If a very large number of events for the basic luminosity spectrum were available, the scaled luminosity spectrum could be directly sampled from them. To avoid storing the large number of basic events the accept–reject method is directly added in GuineaPig.

The differential cross-section of the Bhabha scattering has to be known for the accept–reject method. Instead of using Eq. (5) to calculate the Bhabha cross-section, it is estimated with Bhwide  [15]. Bhwide includes higher-order effects and initial state radiation. Only events with the electron and positron polar angle inside the tracking acceptance (\(7^\circ <\theta <173^\circ \)) are accepted. The cross-section is estimated at precise centre-of-mass energies from 10 GeV to 3000 GeV without any luminosity spectrum. Figure 4 shows the cross-section as given by Bhwide.

Fig. 4
figure 4

Bhabha cross-section from Bhwide for electrons with a polar angle \(7^\circ < \theta < 173^\circ \), and the basic luminosity spectrum from GuineaPig (GP Basic), the luminosity spectrum scaled by bin-wise multiplication with a normalised Bhabha cross-section (GP \(\times \) cross-section), and the luminosity spectrum scaled with an accept–reject method in GuineaPig (GP Scaled). The lines of ‘GP \(\times \) cross-section’ and ‘GP Scaled’ are overlapping

Figure 4 shows the basic luminosity spectrum obtained with GuineaPig, the bin-wise multiplication of the luminosity spectrum with the cross-section, and the scaled luminosity spectrum from GuineaPig with the cross-section used in the accept–reject method. The last two curves are nearly identical showing that the modified GuineaPig produces a properly scaled luminosity spectrum with equally weighted events.

2.3 Generation of Bhabha events

The different luminosity spectra are used in the Bhabha generator to create the events which are observed in the detector. The Bhabha events are generated with Bhwide, where the energies of the initial electron and positron can be defined on an event-by-event basis, as implemented by Rimbault et al. [16]. The polar angle \(\theta \) of the final state electrons must be \(7^\circ < \theta < 173^\circ \) to ensure they will be observable in the tracker. Bhwide produces also Initial and Final State Radiation photons and accounts for their effects during the Bhabha scattering.

The cross-section—including the luminosity spectrum—for events with a centre-of-mass energy above 1.5 TeV is 11 \(\mathrm {pb}\), which results in more than 1 million events for an integrated luminosity of 100 \(\mathrm {fb}^{-1}\). The expectation for the 3 TeV CLIC is an integrated luminosity of 500 \(\mathrm {fb}^{-1}\) per year [17].

2.4 Observables and detector resolutions

Three observables, which can be extracted from the final state electrons, are used for the reconstruction of the luminosity spectrum: the relative centre-of-mass energy calculated from the polar angles of the outgoing electron and positron \(\sqrt{{s^{{\scriptscriptstyle \prime }}_{{\mathrm {acol}}}}}/\sqrt{{s_{\mathrm {nom}}}} \), the energy of the electron \(E_{1}\), and the energy of the positron \(E_{2}\). The relative centre-of-mass energy reconstructed from the acollinearity of the final state electrons is [8, 9]

$$\begin{aligned} \frac{\sqrt{{s^{{\scriptscriptstyle \prime }}_{{\mathrm {acol}}}}}}{\sqrt{{s_{\mathrm {nom}}}}} = \sqrt{\frac{\sin (\theta _{1})+\sin (\theta _{2})+\sin (\theta _{1}+\theta _{2})}{\sin (\theta _{1})+\sin (\theta _{2})-\sin (\theta _{1}+\theta _{2})}}, \end{aligned}$$
(7)

where \(\theta _1\) is the polar angle of the electron and \(\theta _2\) that of the positron.Footnote 3 \(\sqrt{{s^{{\scriptscriptstyle \prime }}_{{\mathrm {acol}}}}}\) is equal to the effective centre-of-mass energy \(\sqrt{s^{\scriptscriptstyle \prime }}\), if only one of the particles radiated photons – Beamstrahlung or Initial State Radiation. If both the electron and the positron radiated photons, the reconstructed centre-of-mass energy \(\sqrt{{s^{{\scriptscriptstyle \prime }}_{{\mathrm {acol}}}}}\) will be larger than \(\sqrt{s^{\scriptscriptstyle \prime }}\).

The Geant4 simulation of tens of millions of electrons is too time-consuming. To include the detector effects, resolution functions of the energy and angles have been obtained from fully simulated and reconstructed Bhabha events using the CLIC_ILD_CDR detector model [18]. The dominating beam-induced background, the \(\upgamma \upgamma \rightarrow \mathrm {hadron}\) events [19], was accounted for.

The rate of electrons produced in Bhabha scattering falls with an increasing polar angle \(\theta \) (cf. Eq. (5)) and the events will be predominantly at small polar angles. Because the magnetic field is nearly collinear to those tracks, their curvature does not allow for an accurate measurement of the momentum. Therefore, the energy is reconstructed using only the calorimeter information. The tracking information is used to measure the angles. The energy resolution is shown in Fig. 5a. It is modelled in the analysis with

$$\begin{aligned} \frac{\sigma _{E}}{E} = \frac{24.3\,\,\%}{\sqrt{E/\mathrm {GeV}}} \oplus 1.23\,\,\%, \end{aligned}$$
(8)

obtained from the reconstruction with overlaid CLIC 3 TeV \(\upgamma \upgamma \rightarrow \mathrm {hadron}\) background. The angular resolution needed for the computation of the relative centre-of-mass energy depends on the energy, shown in Fig. 5b. The angular resolution is better than 20 \(\upmu \mathrm {rad}\) for particle energies above 200 GeV.

Fig. 5
figure 5

Detector resolutions obtained through full reconstruction of Bhabha events with overlay of \(\gamma \gamma \rightarrow \mathrm {hadrons}\) events: a Energy resolution of outgoing electrons and resolution function according to Eq. (8). b Angular resolution for electrons

The distributions of the particle energies and the relative centre-of-mass energy are shown in Fig. 6. Figure 6a and b show the distributions before and Fig. 6c and d after the application of the resolution effects via four-vector smearing. The relative centre-of-mass energy is hardly affected by the resolution, due to the high angular resolution of the tracking detectors. The energy of the particles is much more affected by the detector resolution.

Fig. 6
figure 6

Energy of the final state electrons and relative centre-of-mass energy of the final state before (a, b) and after (c, d) application of the resolution functions

3 Modelling the luminosity spectrum

For the reconstruction of the basic luminosity spectrum with the reweighting fit, a model or parameterisation of the luminosity spectrum is needed. If the beam energy were not affected by beam-energy spread or Beamstrahlung, it could simply be described by a Dirac delta-distribution

$$\begin{aligned} E_{\mathrm {Beam}} (x) = \delta (x-1), \end{aligned}$$
(9)

with the random variates for this function \(x_{E_{\mathrm {Beam}}}=1\). The nominal energy is modified by the contributions from beam-energy spread \(\Delta {x^{}_{\mathrm {Spread}}} \) and Beamstrahlung \(\Delta x^{}_{\mathrm {Strahl}} \) which change the beam energy away from its initial value. The functions describing the two contributions will therefore describe the difference to the nominal value, and the final random variate will be

$$\begin{aligned} x_{\mathrm {Final}} = x_{E_{\mathrm {Beam}}} + \Delta {x^{}_{\mathrm {Spread}}} + \Delta x^{}_{\mathrm {Strahl}}. \end{aligned}$$
(10)

Thus, the functions for the beam-energy spread and Beamstrahlung should describe the energy change of particles due to the respective effect.

The luminosity spectrum model will be built by describing the two particle energies. Using \(x_{1}\) and \(x_{2}\) as defined in Sect. 2, the simplest description of a two-dimensional model is \(\fancyscript{L}\bigl (x_{1},x_{2} \bigr ) = f(x_{1})f(x_{2})\). However, a purely factorising Ansatz is insufficient to describe the correlation between the particle energies. Therefore, the two-dimensional energy distribution is divided into four regions (as shown in Fig. 7): one region where neither particle radiated Beamstrahlung (called the ‘peak’); two regions where one or the other particle radiated Beamstrahlung (called the ‘arms’); and one region where both particles radiated Beamstrahlung (called the ‘body’). This separation is only determined by whether a particle produced Beamstrahlung or not and ignores the beam-energy spread for the moment. The result of this division is a piecewise function

$$\begin{aligned} \fancyscript{L}\bigl (x_{1},x_{2} \bigr ) = {\left\{ \begin{array}{ll} f_{\mathrm {Peak}}, &{} \text {for } x_{1} =1 \mathrm {~and~} x_{2} =1 \\ f_{\mathrm {Arm1}}, &{} \text {for } x_{1} =1 \mathrm {~and~} x_{2} <1 \\ f_{\mathrm {Arm2}}, &{} \text {for } x_{1} <1 \mathrm {~and~} x_{2} =1 \\ f_{\mathrm {Body}}, &{} \text {for } x_{1} <1 \mathrm {~and~} x_{2} <1. \end{array}\right. } \end{aligned}$$
(11)

For each region, the resulting particle energies are described by a product of the functions for the two particles \(f_{\mathrm {Region}} = f_{\mathrm {Region}}^{1}(x_{1}) \cdot f_{\mathrm {Region}}^{2}(x_{2})\), and the individual functions are constructed from convolutions of the beam-energy spread and Beamstrahlung functions, depending on the region.

Fig. 7
figure 7

Two-dimensional luminosity spectrum where the different regions have been labelled. The beam-energy spread is included and in principle all the regions are overlapping

3.1 Parameterisation of the beam-energy spread

The function \(f_{\mathrm {Peak}}\) to describe the behaviour of the beam-energy spread (Fig. 8a) has to rise very steeply at the two extremities. A hyperbolic cosine, parabola, or higher order polynomials – with a reasonable number of parameters – do not describe this energy distribution well. A beta-distribution \(b \bigl (t\bigr ) =\frac{1}{N} t^{\omega ^{\mathrm {a}}_{}} (1-t)^{\omega ^{\mathrm {b}}_{}}\) is used to describe the beam-energy spread. As the beta-distribution is limited between \(0\) and \(1\), a variable transform

$$\begin{aligned} t = \frac{x-x_{\min }}{x_{\max }-x_{\min }} \end{aligned}$$
(12)

is used to describe the beam-energy spread between the two endpoints \(x_{\min } \) and \(x_{\max } \) near the maximal values at the beginning and end of the distribution. Here, the variable \(x\) is the relative difference between a particle’s energy \(E_{\mathrm {Particle}}\) and the nominal beam energy \(E_{\mathrm {Beam}}\),

$$\begin{aligned} x = \frac{E_{\mathrm {Particle}}-E_{\mathrm {Beam}}}{E_{\mathrm {Beam}}} = \frac{\Delta {E}}{E_{\mathrm {Beam}}}, \end{aligned}$$
(13)

which corresponds to \(\Delta {x^{}_{\mathrm {Spread}}} \) from Eq. (10). To also describe the particles with energies below the \(x_{\min } \) and above \(x_{\max }\), the beta-distribution is convoluted with a Gaussian distribution \(g\bigl (x\bigr )\) with a mean \(\mu =0\) and a width \(\sigma \). The Beam-Energy Spread (BES) function is

$$\begin{aligned} \mathrm {BES} \bigl (x; \omega ^{\mathrm {a}}_{}, \omega ^{\mathrm {b}}_{}, \sigma \bigr ) = b(x; \omega ^{\mathrm {a}}_{}, \omega ^{\mathrm {b}}_{}) \otimes g\bigl (x; \sigma \bigr ), \end{aligned}$$
(14)

where \(\otimes \) is the convolution operator \(h(x) \equiv \left( f \otimes g\right) (x) \equiv \int _{-\infty }^{\infty } f(\tau ) g(x-\tau ) \mathrm {d}\tau \). Due to Fubini’s theorem the convolution of two probability density functions always results in another probability density function [20].

Fig. 8
figure 8

a The beam-energy spread distribution from the accelerator simulation [12] and the best fit to the beam-energy spread with Eq. (14). b Energy spread after the simulation in GuineaPig. The distribution of the Peak requires that both particles have \(E > 0.995 E_{\mathrm {Beam}} \), and the distribution in the Arms requires that one particle has \(E < 0.995 E_{\mathrm {Beam}} \). The colour bands in both plots indicate the confidence interval at 99 %

The beam-energy spread histogram is fitted by the function \(\mathrm {BES} \bigl (x\bigr )\) with a binned log-likelihood fit with Root version 5.34.01 [21]. Figure 8a shows the best fit to the beam-energy spread with this model, and the resulting parameters are given in Table 1. The histogram contains 300 000 entries.

Table 1 Parameters found by the fit of Eq. (14) to the beam-energy spread from the accelerator simulation and to the beam-energy spread for two different regions of the luminosity spectrum

The width of the Gaussian \(\sigma \) and the boundaries of the beam-energy spread beta-distributions (\(x_{\min }, x_{\max } \)) are fixed for all following fits. This assumes an existing knowledge of the beam-energy spread coming from the accelerator. Fixing these parameters can introduce a large systematic error, if they are not measured correctly.

3.1.1 Luminosity weighted beam-energy spread

The correlation between the particle energy and its position in the bunch causes a change in the effective beam-energy spread. The probability to radiate Beamstrahlung photons, and therefore the fraction of energy lost by particles, increases with the distance travelled in the electromagnetic field of the oncoming bunch.

Figure 8b shows two energy distributions, one of the Peak-region, where both particles posses an energy of more than 99.5 % of the nominal beam energy, and one of the Arms-region, where only one of the particle contains more than 99.5 % of the nominal beam energy. Both histograms contain 300 000 entries.

The energy spread of the Peak-region is clearly flatter than the energy spread coming from the accelerator (Fig. 8). In the column Peak, Table 1 lists also the parameters \(\omega ^{\mathrm {a}}_{} \) and \(\omega ^{\mathrm {b}}_{} \) found by fitting Eq. (14) to the distribution. For this fit, the limits and the Gaussian width are fixed. For the best fit both \(\omega ^{\mathrm {a}}_{} \) and \(\omega ^{\mathrm {b}}_{} \) are closer to zero, but still negative, i.e., there are still two maxima at the lower and upper end of the distribution. The peak at the lower end of the spectrum is reduced, because the particles in the tail are less likely to interact with particles that did not radiate Beamstrahlung.

The energy spread of the Arms-region, where one of the particles radiated Beamstrahlung, shows a large peak at the lower energy, and almost no peak at the upper end of the spectrum. This is also caused by the correlation between the energy and the position in the bunch. Particles in the tail are more likely to collide with a particle that already radiated Beamstrahlung, and therefore the peak at the lower edge of the beam-energy spread is enhanced. Likewise, only very few particles with the highest energy—located near the front of the bunch—interact with particles from the tail of the bunch, which leads to the disappearance of the peak at the highest energies. The beam-energy spread for the Arms-region is described by a beta-distribution for which \(\omega ^{\mathrm {b}}_{} > 0\) (see column Arms in Table 1).

The \(\chi ^{2}/\mathrm {ndf} \) becomes larger for the fits to the luminosity weighted beam-energy spreads than for the fit to the initial beam-energy spread. The chosen function cannot perfectly model the distributions, however, the fits are only used to check qualitatively if the model can represent the luminosity spectrum at this stage. In addition, as there was only a single input file available to run GuineaPig, the macro-particles are re-used for luminosity events. This re-use means that the fluctuations in the number of entries are larger than what can be expected from the statistical uncertainties, which also increases the \(\chi ^{2}/\mathrm {ndf} \) value.

3.2 Beamstrahlung

Following Ohl’s Circe model [22], the energy distribution of the particles after the emission of Beamstrahlung photons is modelled with a beta-distribution. Beamstrahlung will always reduce the energy of a particle, so that the random variate \(\Delta x^{}_{\mathrm {Strahl}} \) would be between \(-1\) and \(0\) (cf. Eq. (10)). Beta-distributions are limited between 0 and 1, so that the function describing the Beamstrahlung effect is convoluted with the \(\delta \)-distribution from Eq. (9), which moves the range to \( 0 < x^{}_{\mathrm {Strahl}} = 1 + \Delta x^{}_{\mathrm {Strahl}} < 1\) and no further variable transform is necessary for the probability density function.

The parameters of the beta-distributions used to describe the energy distribution due to Beamstrahlung are called \(\eta ^{\mathrm {a}}_{} \) and \(\eta ^{\mathrm {b}}_{} \). The beta-distribution parameters must fulfil the conditions \( 0 < \eta ^{\mathrm {a}}_{} \) and \( -1 < \eta ^{\mathrm {b}}_{} < 0\) for the distribution to fall towards \(x=0\) and rise towards \(x=1\).

Previous studies by Daniel Schulte have shown that the tail of the CLIC centre-of-mass energy distribution is better modelled by a sum of three beta-distributions. Therefore, the energy distributions from Beamstrahlung are initially fitted by linear combinations of \(N_{\mathrm {Beta}} \) incomplete beta-distributions

$$\begin{aligned} b_{\mathrm {linear}}(x) = \sum _{i=1}^{N_{\mathrm {Beta}}} p_{i}b \bigl (x;[p]^{i},\beta _{\mathrm {Limit}}^{} \bigr ), \end{aligned}$$
(15)

and the constraint

$$\begin{aligned} 1 \overset{!}{=} \sum _{i=1}^{N_{\mathrm {Beta}}} p_{i}, \end{aligned}$$
(16)

where \(p_{i}\) are the respective fractions of the individual beta-distribution contribution and \([p]^{i}=\{\eta ^{\mathrm {a}}_{i},\eta ^{\mathrm {b}}_{i} \}\) the parameter-set for each beta-distribution. The beta-distributions are limited with an upper limit of \(\beta _{\mathrm {Limit}}^{} =0.995\). Above 0.955 the beam-energy spread is dominant and would have to be included for the fit.

Figure 9 shows the fits with \(N_{\mathrm {Beta}} =1, 2, 3\) to the distribution of the particle energy. In Fig. 9 the fit to the histogram is performed in the range of \(0.0 < x < 0.995\); It is visible that the function with three beta-distributions—with eight free parameters—shows a better agreement with the distribution than the other functions. As all the beta-distributions cover the full range for the fit, there are large correlations between the parameters of different beta-distributions.

Fig. 9
figure 9

Fit of the linear combination of one, two, and three beta-distributions (Eq. (15)) to the particle energy spectrum after Beamstrahlung. a Fit for \(0.0 < x < 0.995\), b fit for \(0.5 < x < 0.995\)

Figure 9 shows the same fit of linear combinations with a range limited to \( 0.5 < x < 0.995\); all three fit-functions overlap. Therefore, a single beta-distribution is enough to describe the particle energy between half and 99.5 % of the beam-energy. For the Model, the Beamstrahlung is described by a single beta-distribution to reduce the number of free parameters. However, this will also limit the energy range in which our Model can be considered as valid.

3.3 The Model for the full luminosity spectrum

The individual contributions discussed in the previous sections are now used to create the Model of the basic two-dimensional luminosity spectrum. As was discussed in Sect. 3.1 the beam-energy spread \(\mathrm {BES} \bigl (x\bigr )\) is described by a convolution of a Gaussian function \(g\bigl (x\bigr )\) and a beta-distribution \(b \bigl (x\bigr ) \)

$$\begin{aligned} \mathrm {BES} \bigl (x\bigr ) = \left( b \otimes g\right) (x). \end{aligned}$$
(17)

The beta-distribution for the beam-energy spread has a very narrow range. The particle energy distribution including the energy loss due to Beamstrahlung is described by a convolution of the beam-energy spread with an incomplete beta-distribution with the upper limit of \(\beta _{\mathrm {Limit}}^{\mathrm {Arm}} =0.9999\),

$$\begin{aligned} \mathrm {BB}\bigl ( x \bigr ) = \left( b \otimes \mathrm {BES} \right) (x). \end{aligned}$$
(18)

The upper limit is chosen to be close to 1, so that the convolution with beam-energy spread causes an overlap with the Peak-region (cf. Fig. 10).

Fig. 10
figure 10

Different parts of the Model, described by Eq. (20)

To describe particle energy distributions only negligibly affected by the beam-energy spread, a beta-distribution with an upper limit of \(\beta _{\mathrm {Limit}}^{\mathrm {Body}} =0.995\) convoluted with a Gaussian function is used

$$\begin{aligned} \mathrm {BG}\bigl ( x \bigr ) = \left( b \otimes g\right) (x). \end{aligned}$$
(19)

This upper limit separates the distribution from those more significantly affected by the beam-energy spread. This function is different from (17) due to the different ranges of the beta-distributions.

As described in Eq. (11), the distributions in the four different regions are described by the product of two functions, one for each particle. The explicit piecewise description shown in Eq. (11), however, is replaced by the use of delta-distribution and implicit ranges of the individual functions. The Peak region is described by two pure beam-energy spread functions (Eqs. (14) or (17)) and delta-distributions to signify the absence of Beamstrahlung; the Arms are modelled by one beam-energy spread function and a delta distribution, and one function describing Beamstrahlung convoluted with the beam-energy spread (Eq. (18)); the Body is described by two functions describing only the Beamstrahlung (Eq. (19)).

$$\begin{aligned} \fancyscript{L}\bigl (x_{1},x_{2} \bigr )&= p_{\mathrm {Peak}} \delta (1-x_{1})\otimes \mathrm {BES} \bigl (x_{1};[p]^{1}_{\mathrm {Peak}} \bigr ) \nonumber \\&\delta (1-x_{2})\otimes \mathrm {BES} \bigl (x_{2};[p]^{2}_{\mathrm {Peak}} \bigr ) \nonumber \\&+ p_{\mathrm {Arm1}} \delta (1-x_{1})\otimes \mathrm {BES} \bigl (x_{1};[p]^{1}_{\mathrm {Arm1}} \bigr ) \nonumber \\&\mathrm {BB}\bigl ( x_{2};[p]^{2}_{\mathrm {Arm1}},\beta _{\mathrm {Limit}}^{\mathrm {Arm}} \bigr ) \nonumber \\&+ p_{\mathrm {Arm2}} \mathrm {BB}\bigl ( x_{1};[p]^{1}_{\mathrm {Arm2}},\beta _{\mathrm {Limit}}^{\mathrm {Arm}} \bigr ) \nonumber \\&\delta (1-x_{2})\otimes \mathrm {BES} \bigl (x_{2};[p]^{2}_{\mathrm {Arm2}} \bigr ) \nonumber \\&+ p_{{\mathrm {Body}}} \mathrm {BG}\bigl ( x_{1};[p]^{1}_{{\mathrm {Body}}},\beta _{\mathrm {Limit}}^{\mathrm {Body}} \bigr ) \nonumber \\&\mathrm {BG}\bigl ( x_{2};[p]^{2}_{{\mathrm {Body}}},\beta _{\mathrm {Limit}}^{\mathrm {Body}} \bigr ) , \end{aligned}$$
(20)

with \(\beta _{\mathrm {Limit}}^{\mathrm {Arm}} =0.9999\), \(\beta _{\mathrm {Limit}}^{\mathrm {Body}} =0.995\). In addition, the constraint

$$\begin{aligned} p_{{\mathrm {Body}}} = 1 - p_{\mathrm {Peak}}- p_{\mathrm {Arm1}}- p_{\mathrm {Arm2}} \end{aligned}$$
(21)

has to be fulfilled, which results in

$$\begin{aligned} \int \fancyscript{L}\bigl (x_{1},x_{2} \bigr ) \mathrm {d}x_{1} \mathrm {d}x_{2} = 1, \end{aligned}$$
(22)

as required for a probability density function. The function given in Eq. (20) will be used to describe the luminosity spectrum. The random variates according to the individual parts of Eq. (20) are shown in Fig. 10. Each summand of Eq. (20) corresponds to one of the distributions. Due to the convolution of beam-energy spread and Beamstrahlung functions, the region around the nominal beam energies (\(x_{1} \approx 1\), \(x_{2} \approx 1\)) is described by a superposition of individual contributions.

4 Reweighting fit

The separate one-dimensional parts of the luminosity spectrum were fitted to the parts of the Model. Now the complete Model has to be fit to the two-dimensional spectrum.

It is possible to fit Eq. (20) to the basic luminosity spectrum. The convolutions with the \(\delta \)-distribution can be performed explicitly. The other convolutions have to be performed numerically, because the convolution between the beta-distribution and the Gaussian function cannot be expressed in a closed form.Footnote 4

For the implementation of the function the numerical convolutions are evaluated with the QAGFootnote 5 integration algorithm [23] interfaced via the GSLIntegrator from Root MathMore. The evaluation of the function takes about 160 s for the full range. A direct fit with the function, requiring multiple iterations, would be slow. The fitting procedure can be sped up by using a reweighting fit and by exploiting the fact that the random variates according to Eq. (20) can also be described by the sum of the random variates of the individual functions [24].

The principle of the reweighting technique is shown in Fig. 11.

Fig. 11
figure 11

Flowchart diagram of the reweighting fit procedure. The dashed arrows indicate the different event samples used for the different fits

A \(\chi ^{2}\) minimization, utilizing Minuit  [25] implemented in Root, is used to fit a sample of Model events to the GuineaPig sample. The procedure starts with the generation of a large number of events according to the Model. This produces events consisting of pairs of beam energies \((x_{1} ^{i}, x_{2} ^{i})\) and the corresponding probability \(\fancyscript{L}\bigl (x_{1} ^{i}, x_{2} ^{i}; [p]^{0}\bigr ) \) to obtain a given event. The probability depends on the initial set of parameter values \([p]^{0}\).

The minimizer is used to obtain a new set of parameter values \([p]^{N}\) that results in new probabilities \(\fancyscript{L}\bigl (x_{1} ^{i},x_{2} ^{i}; [p]^{N}\bigr ) \) for each event. The event weight

$$\begin{aligned} w^{i} = \frac{\fancyscript{L}\bigl (x_{1} ^{i},x_{2} ^{i}; [p]^{N}\bigr )}{\fancyscript{L}\bigl (x_{1} ^{i},x_{2} ^{i}; [p]^{0}\bigr )} \end{aligned}$$
(23)

is used to weight each event of the Monte Carlo distribution. For every set of parameter values \([p]^{N}\) a reweighted Monte Carlo distribution is obtained. The minimum \(\chi ^{2}\) between the distribution of the Model and the distribution from GuineaPig corresponds to the optimal parameter values matching the GuineaPig sample.

The \(\chi ^{2}\) between the two histograms is calculated from the number of entries in bin \(j \) of the GuineaPig sample \(N_{\mathrm {GP}} ^{j}\) and its uncertainty \(\sigma _{\mathrm {GP}} ^{j}\), the sum of the weights in bin \(j \) of the Model sample \(N_{\mathrm {Model}} ^{j}\), and the uncertainty \(\sigma _{\mathrm {Model}} ^{j}\), which are calculated from the event samples according to

$$\begin{aligned} \begin{aligned} N_{\mathrm {GP}} ^{j}&= \sum _{\text {GP Events i in Bin j}} 1\, , \\ \sigma _{\mathrm {GP}} ^{j}&= \sqrt{N_{\mathrm {GP}} ^{j}} , \\ N_{\mathrm {Model}} ^{j}&= \sum _{\text {Model Events i in Bin j}} w^{i} , \\ \sigma _{\mathrm {Model}} ^{j}&= \sqrt{\sum _{\text {Model Events i in Bin } j} (w^{i})^{2}}. \end{aligned} \end{aligned}$$
(24)

The \(\chi ^{2}\) to be minimized is then calculated with

$$\begin{aligned} \chi ^{2} = \sum _{\text {Bins j}} \frac{\bigl ( N_{\mathrm {GP}} ^{j} - f_{S} \cdot N_{\mathrm {Model}} ^{j} \bigr )^{2}}{(\sigma _{\mathrm {GP}} ^{j})^{2} + (f_{S} \cdot \sigma _{\mathrm {Model}} ^{j})^{2}}, \end{aligned}$$
(25)

where

$$\begin{aligned} f_{S} =\frac{\sum \nolimits _{\text {GP Events}}{1}}{\sum \nolimits _{\text {Model Events}~i}{w^{i}}} \end{aligned}$$
(26)

is a scaling factor that takes into account the difference in the sample sizes and the normalisation of the event weights due to the limited number of Model-events. The entire procedure has the advantage that only one sample of Model-events is needed, contrary to traditional template-fit procedures that require generating new Monte Carlo samples for every parameter-set.

However, by itself this requires even more evaluations of Equation (20)—one for every Monte Carlo event used in the generated sample—but the random variates according to this function can also be described by the sum of the random variates of the individual functions. The particle energy can be built up from the individual contributions

$$\begin{aligned} x_{\mathrm {Particle}} = x^{}_{\mathrm {Strahl}} + {x^{}_{\mathrm {Spread}}} + {x^{}_{\mathrm {G}}}, \end{aligned}$$
(27)

where \(x^{}_{\mathrm {Strahl}}\) is the random variate from the beta-distribution for the Beamstrahlung, \({x^{}_{\mathrm {Spread}}}\) the random variate from the beta-distribution for the beam-energy spread, and \({x^{}_{\mathrm {G}}}\) the random variate from the Gaussian-function of the beam-energy spread. Each random variate can be generated according to its probability density function. Equation (27) is connected to Eq. (10): \(x^{}_{\mathrm {Strahl}} =x_{E_{\mathrm {Beam}}}+\Delta x^{}_{\mathrm {Strahl}} \), and \({x^{}_{\mathrm {Spread}}} + {x^{}_{\mathrm {G}}} = \Delta {x^{}_{\mathrm {Spread}}} \). During the generation of events, the combination of functions is chosen according to the probability given by the parameters for each region \(p_{\mathrm {Peak/Arm1/Arm2/Body}}\). The probability for a particle’s energy in an event is given by the product of all individual probabilities

$$\begin{aligned} P(x^{}_{\mathrm {Strahl}},{x^{}_{\mathrm {Spread}}},{x^{}_{\mathrm {G}}}) = b \bigl (x^{}_{\mathrm {Strahl}} \bigr ) \cdot b \bigl ({x^{}_{\mathrm {Spread}}} \bigr ) \cdot g\bigl ({x^{}_{\mathrm {G}}} \bigr ), \end{aligned}$$
(28)

and the product of the probabilities for the individual particles multiplied by the probability for the region gives the probability for the event

$$\begin{aligned}&\fancyscript{L}\bigl (x^{1}_{\mathrm {Strahl}},{x^{1}_{\mathrm {Spread}}},{x^{1}_{\mathrm {G}}},x^{2}_{\mathrm {Strahl}},{x^{2}_{\mathrm {Spread}}},{x^{2}_{\mathrm {G}}} \bigr ) \nonumber \\&\quad \equiv p_{\mathrm {Region}} \cdot P(x^{1}_{\mathrm {Strahl}},{x^{1}_{\mathrm {Spread}}},{x^{1}_{\mathrm {G}}}) \cdot P(x^{2}_{\mathrm {Strahl}},{x^{2}_{\mathrm {Spread}}},{x^{2}_{\mathrm {G}}}).\nonumber \\ \end{aligned}$$
(29)

Thus Eq. (23) becomes

$$\begin{aligned} w^{i}&= \frac{p_{\mathrm {region}}^{N} b \bigl (x^{i,1}_{\mathrm {Strahl}},[p]^{N}\bigr ) b \bigl ({x^{i,1}_{\mathrm {Spread}}},[p]^{N}\bigr ) g\bigl (x^{i,1}_{\mathrm {G}},[p]^{N}\bigr )}{p_{\mathrm {region}}^{0} b \bigl (x^{i,1}_{\mathrm {Strahl}},[p]^{0}\bigr ) b \bigl ({x^{i,1}_{\mathrm {Spread}}},[p]^{0}\bigr ) g\bigl (x^{i,1}_{\mathrm {G}},[p]^{0}\bigr )}\,\nonumber \\&\quad \times \frac{b \bigl (x^{i,2}_{\mathrm {Strahl}},[p]^{N}\bigr ) b \bigl ({x^{i,2}_{\mathrm {Spread}}},[p]^{N}\bigr ) g\bigl (x^{i,2}_{\mathrm {G}},[p]^{N}\bigr )}{b \bigl (x^{i,2}_{\mathrm {Strahl}},[p]^{0}\bigr ) b \bigl ({x^{i,2}_{\mathrm {Spread}}},[p]^{0}\bigr ) g\bigl (x^{i,2}_{\mathrm {G}},[p]^{0}\bigr )}\nonumber \\ \end{aligned}$$
(30)

and no numerical convolutions have to be calculated.

The probability for obtaining energies \(x_{1}\) and \(x_{2}\) is not the same as the probability to obtain a specific group of variates, even if \(x^{1}_{\mathrm {Strahl}} +{x^{1}_{\mathrm {Spread}}} +{x^{1}_{\mathrm {G}}} \!=\!x_{1} \) and \(x^{2}_{\mathrm {Strahl}} \!+\!{x^{2}_{\mathrm {Spread}}} \!+\!{x^{2}_{\mathrm {G}}} \!=\!x_{2} \). There are many combinations of the variates \({x^{}_{\mathrm {Spread}}}\), \(x^{}_{\mathrm {Strahl}}\), and \({x^{}_{\mathrm {G}}}\), which can lead to the same \(x_{1}\) or \(x_{2}\). To estimate the probability for any given pair of energies \(\fancyscript{L}\bigl (x_{1},x_{2} \bigr )\) the convolutions have to be performed either numerically or via Monte Carlo generation.

4.1 Application of the reweighting fit to other distributions

The reweighting fit is also used to fit the distributions after the inclusion of the Bhabha scattering, Initial and Final State Radiation, and detector resolutions. The individual events are passed through the Bhabha Monte Carlo generator and detector simulation, which can be understood as additional convolutions of the existing distribution. As can be seen in Eq. (30), if the parameter governing one of the contributions does not change, the contribution does not affect the new weight. This enables the use of the reweighting fit also for the reconstruction of the spectrum from the Bhabha events, because the Bhabha scattering and detector resolutions \(\mathrm {D}\bigl (O_{k}\bigr ) \) are not varied during the fit. A measured distribution \(f\) of Observables \(O_{k}\) can be approximately written as

$$\begin{aligned} f(O_{1},O_{2},\ldots )&\approx \sigma (E_{1},E_{2};O_{1},O_{2},\ldots ) \times \fancyscript{L}\bigl (E_{1},E_{2}\bigr ) \nonumber \\&\quad \otimes \mathrm {ISR}\bigl (E_{1},E_{2}\bigr ) \otimes \mathrm {FSR}\bigl (O_{1},O_{2},\ldots \bigr ) \nonumber \\&\quad \otimes \mathrm {D}\bigl (O_{1}\bigr ) \mathrm {D}\bigl (O_{2}\bigr ) \ldots , \end{aligned}$$
(31)

where \(\sigma \) represents the centre-of-mass energy dependence of the Bhabha scattering and the observables, \(\mathrm {ISR}\bigl (E_{1},E_{2}\bigr )\) represents the probability for the energy distribution after Initial State Radiation, and \(\mathrm {FSR}\bigl (O_{1},O_{2},\ldots \bigr )\) represents the probability for the energy distribution after Final State Radiation. If the cross-section and detector resolutions are well enough known, the only difference between the measured and generated distributions is the luminosity spectrum. For this study the same Bhabha generator and detector simulations are used for both samples, so the additional effects are statistically the same. Any difference for the contributions can lead to a systematic error in the reconstruction of the luminosity spectrum.

4.2 Equiprobability binning

The \(\chi ^{2}\)-fit requires binned histograms. To obtain an unbiased estimator of the compatibility in a \(\chi ^{2}\)-fit, all bins should contain at least seven entries, and the number of events in all bins should be similar [26, p. 304]. These requirements can be fulfilled when an equiprobability binning is generated based on the respective GuineaPig sample used in the fits. With equal-size bins either a large number of bins could be used—where most would contain very few or no entries and would have to be rejected for the \(\chi ^{2}\) calculation—and the peak substructure could be resolved, or fewer bins with larger dimensions could be used, but then the peak could not be resolved. Therefore, the equiprobability binning can make better use of the available events.

Following the recipe of James [26, p. 305], the events are first evenly separated along one axis, and then all events falling in the range on this first axis are again evenly separated in the second axis. If additional dimensions are used, the separation is repeated. In this way each bin has different dimensions along each axis, but the number of events per bin is constant.

For the fit to the basic luminosity spectrum, as discussed in Sect. 5.1, the distribution of the two particle energies is stored in a two-dimensional histogram. For the reconstruction of the spectrum from the Bhabha events the energy of the scattered electron and positron, and the relative centre-of-mass energy reconstructed from the acollinearity \({\sqrt{{s^{{\scriptscriptstyle \prime }}_{{\mathrm {acol}}}}}}/{\sqrt{{s_{\mathrm {nom}}}}}\) are filled into a three-dimensional equiprobability histograms. Figure 12 shows examples for a two- and three-dimensional bin structure. It can be seen that around the nominal beam energies the size of the bins becomes smaller. Because the separation of events is done individually along each axis, the bin structures are not symmetric.

Fig. 12
figure 12

Example of the equiprobability binning in a two dimensions (zoomed to the peak region) b and three dimensions. The colours are arbitrary. By construction every cell contains a similar number of events

4.3 Uncertainty estimation

In order to ensure that the Model is unbiased and consistent, a large number of Model vs. Model fits were performed with a varying number of bins. In each case, the procedure is as follows: two sets of events are created according to the Model of the basic luminosity spectrum. The samples are then used in the fit procedure described before. In each fit a cut on the centre-of-mass energy of \(\sqrt{s^{\scriptscriptstyle \prime }} > 1.5~\mathrm {TeV}\) is applied.

The pull distribution of every free parameter is obtained and fitted with a Gaussian function. The Model is unbiased, if the mean of every pull distribution is close to zero. The uncertainty is correctly estimated, if the pull width is compatible with unity. This is called the Normality condition [26, p. 310].

For all parameters the pull distributions are independent of the binning. Most parameters are unbiased (i.e., the mean is zero). Exceptions are the \(\eta ^{\mathrm {a}}_{\mathrm {Body1}} \) and \(\eta ^{\mathrm {a}}_{\mathrm {Body2}} \) parameters, whose pulls are not normally distributed. These parameters describe the behaviour of the beta-distribution at the lower edge of the respective particle energy distribution. Therefore, the bias is caused by the cut on the centre-of-mass energy, which reduces the sensitivity to the lower energy part of the Body. The lower limit of these parameters is zero, which is often found by the minimizer instead of the nominal value. When the cut on the centre-of-mass energy is removed, the pulls are symmetric, and the parameters are correctly estimated. It is also found that the width of the Gaussian function is consistent with unity, so the uncertainties are correctly estimated by the minimization procedure.

5 Luminosity spectrum reconstruction

All ingredients for the reconstruction of the luminosity spectrum—the Model and the reweighting procedure—are now available.

The fits based on the basic luminosity spectrum (Sect. 5.1) are used to assess the similarity between the Model and the GuineaPig spectrum. The energies of the electron and positron pairs are filled into the two-dimensional equiprobability structure used in the reweighting fit. In the next step, the cross-section scaling, the Bhabha scattering, and the smearing for the detector-resolutions are applied and the reweighting fit is done with the observables defined in Sect. 2.4. The step-by-step inclusion of the intermediate steps and their impact on the reconstruction is detailed elsewhere [27].

The initial values of the parameters, used to generate the Model events, are given in Table 2. All the regions are chosen to start with a similar number of events (25 %). The starting \(\omega ^{}_{}\) parameters are taken from the fit to the beam-energy spread before the collisions (Table 1). The other parameters are chosen arbitrarily in a way to cause a behaviour similar to the GuineaPig luminosity spectrum. The position of the two boundaries for the beam-energy spread (\(x_{\min }\) and \(x_{\max }\)) are also taken from Table 1. Table 2 also lists the lower and upper bounds limiting the values for the minimizer.

Table 2 Initial parameter values used for the generation of the events. Also listed are the lower and upper bounds used in the reweighting fits

5.1 Fit to the basic luminosity spectrum

To verify that the Model can represent the basic luminosity spectrum from GuineaPig, the distribution of the initial particle energies are used in the \(\chi ^{2}\)-fit. The data histogram is shown in Fig. 2. The Monte Carlo sample is shown in Fig. 10. The GuineaPig sample consists of 3 million events and the Model provides 10 million events. Fits are done with a binning varied from \(50\times 50\) bins to \(300\times 300\) bins in steps of 10 bins. Only events with \(\sqrt{s^{\scriptscriptstyle \prime }} > 1.5~\mathrm {TeV}\) are used in the fit. The cut is applied because the Model has limited validity range, and the events below half the nominal centre-of-mass energy would have a negative impact on the fit result.

As an example for the result of the reweighting fit, Fig. 13 shows a small section of the histogram mapped onto one dimension and the pull distribution for all the bins before and after the fit. The data histogram has a constant number of events per bin, as designed by the equiprobability binning. The pull distribution after the fit converged is well centred around 0 with a width of 2.3. The width of the pull distribution is not equal to 1, because the \(\chi ^{2}/\mathrm {ndf} \) is larger than 1. This means that the Model is not completely identical to the GuineaPig distribution. Two of the differences are the limited number of beta-distributions used to model the tail of the spectrum (see Sect. 3.2), where deviations appear, and the differences in the peak of the spectrum (see Sect. 3.1), where a much larger number of parameters would be needed. As the \(\chi ^{2}/\mathrm {ndf} \) is not equal to unity, additional parameters would enable a better description of the spectrum.

Fig. 13
figure 13

a Blow-up of a small section of the bins used in the re-weighting fit of the initial electron energies. The histogram for GuineaPig (black) has, by construction, a constant number of events per bin. Also shown are the histograms for the Model with the initial parameter values before the fit and after the fit. b Distribution of the pulls \(\Omega ={\bigl (N_{\mathrm {GP}} ^{j} - f_{S} N_{\mathrm {Model}} ^{j} \bigr )}/{\bigl ((\sigma _{\mathrm {GP}} ^{j})^{2} + (f_{S} \sigma _{\mathrm {Model}} ^{j})^{2}\bigr )^{{1}/{2}}}\) (symbols are defined in Eqs. (24) and (26)) between the GuineaPig and Model samples before and after the re-weighting fit. A Gaussian function is fitted to the distribution of pulls after the fit

For one fixed binning, the fit to the luminosity spectrum was done 198 times with the same GuineaPig sample and independent Model samples. All the parameter values vary within their uncertainties. Therefore, the Model and the fit procedure are consistent.

5.2 Fits to the observables

The observables are defined in Sect. 2.4. BinningsFootnote 6 from \(10\times 10\times 10\) bins to \(80\times 50\times 50\) bins were used in the fits. The binning step is 5 bins for the relative centre-of-mass energy and 10 bins for the particle energies.

For the last step, the Bhabha events generated with the scaled luminosity spectrum are smeared with the detector resolutions as described in Sect. 2.4. The selection cuts had to be modified, and the cut is applied on the centre-of-mass energy calculated from the smeared four-vectors \(\sqrt{{s^{{\scriptscriptstyle \prime }}_{{\mathrm {4\text {-}vec}}}}} > 1.5~\mathrm {TeV}\) and in addition on the individual particle energies \(E_{1} > 150~\mathrm {GeV}\) and \(E_{2} > 150~\mathrm {GeV}\). To recover Final State Radiation, the energy of all photons in a 3\(^\circ \) cone around an electron is summed up.

5.3 Discussion of the results

For the two stages of the reconstruction multiple fits with different binnings were done. However, as the reconstructed spectra are fairly similar, only one reconstructed luminosity spectrum per stage is shown in detail. In addition, the parameter dependence on the number of bins is shown. For the fits to the basic and scaled luminosity spectrum the results with \(100\times 100\) bins are shown. For the reconstruction from the observables the fits with \(40\times 50\times 50\) bins are shown.

In Table 3 the \(\chi ^{2}/\mathrm {ndf} \) and parameters extracted by the selected fit stages are listed. The reconstructed parameters are far away from the initial values of the parameters (cf. Table 2), therefore the fit results are not artificially improved by using a good starting point. The final values of the beam-energy spread parameters \(\omega ^{}_{}\) are close to the values found by the one-dimensional fit to the beam-energy spread distributions detailed in Sect. 3.1. Because of the cut on the minimum centre-of-mass energy, the sensitivity on the lower Beamstrahlung parameter \(\eta ^{\mathrm {a}}_{} \) is lost, and both fits give a result of \(\approx 0\) with large uncertainty for these parameters. The reconstruction of the upper Beamstrahlung parameter \(\eta ^{\mathrm {b}}_{} \) is consistent.

Table 3 The parameter values found in selected fits to the initial electron and positron energies (first rows) and to the observables (second rows). The details of the fits are given in the text

The largest variation in the parameters is observed for the beam-spread parameters \(\omega ^{}_{}\). This increase is mostly due to the detector effects. In total the uncertainty increases by a factor ten, and the values are significantly different.

There are significant correlations between the parameters. The largest correlations are between parameters from the same beta-distribution. The correlations also increase when the additional effects are taken into account. Some of the changes of the parameter values could, therefore, be due to increased correlations.

Table 4 lists the fraction of events with a centre-of-mass energy larger than \(0.99\sqrt{{s_{\mathrm {nom}}}} \) from GuineaPig and from selected fits of the different fit stages. The uncertainty of the GuineaPig value is the statistical uncertainty from one million events. The uncertainty for the fits is calculated from the uncertainty of the individual parameters and accounts for the correlation between them.

Table 4 Summary of the fraction of events with \(\sqrt{s^{\scriptscriptstyle \prime }} > 0.99\sqrt{{s_{\mathrm {nom}}}} \) from GuineaPig and the reconstructed luminosity spectra from the different fit stages

The difference of the fractions between GuineaPig and the Model is less than one percentage point. Given the size of the uncertainties the difference is significant. However, processes with lower cross-section will effectively use smaller samples from the luminosity increasing the uncertainty to around one percentage point. The difference in the fraction of events in the top 1 % might therefore be insignificant for other measurements at 3 TeV.

The basic luminosity spectrum from GuineaPig compared with the reconstructed basic luminosity spectra from the two fit stages for the selected fits are shown in Figs. 14 and 16. For the ratios the green error bars show the statistical uncertainty for one million GuineaPig events and the barely visible red error bars show the uncertainty coming from the parameterisation.

Fig. 14
figure 14

Resulting spectra for the fit to the basic luminosity spectrum and a binning of \(100\times 100\). ac Basic luminosity spectrum from GuineaPig compared to the Model after the fit. The integral over the full range is normalized df Relative difference between GuineaPig and the Model after the fit. The horizontal error bars mark the statistical uncertainty from GuineaPig, the additional error bars mark the uncertainty from the parameters

In both cases the luminosity spectrum is reconstructed within 5  % between \(0.55\sqrt{{s_{\mathrm {nom}}}} \) and \(0.995\sqrt{{s_{\mathrm {nom}}}} \). Close to the peak, above \(0.995\sqrt{{s_{\mathrm {nom}}}} \), the beam-energy spread is the dominant effect and the difficulty of modelling this peak becomes visible. Still, this difference is seen only, when looking at small bin sizes (e.g., compare the bins around 1 in Fig. 14d or e with Fig. 14f). As Table 4 shows, the average fraction around the peak is reconstructed within one percentage point. Improved parameterisations should be able to better describe and reconstruct the shape of the peak, at the cost of longer run-time for the fit.

Below \(0.5\sqrt{{s_{\mathrm {nom}}}} \), the Model is much more inconsistent with GuineaPig, but this is given by the design of this Model and the cut on the centre-of-mass energy applied for the fits.

Some of the reconstructed parameter values depend on the number of bins used in the fit. Figure 15 shows the dependence of the reconstructed parameters on the number of bins used in the fit. Fits with a binning of \(50\times 50\) bins to \(300\times 300\) bins with the same number of events were done. In Fig. 15 the results are sorted by \(\chi ^{2}/\mathrm {ndf} \), or increasing number of bins. The Binning ID corresponds to the number of bins.

Fig. 15
figure 15

Parameter variation with respect to the Binning ID for the fits to the basic luminosity spectrum. The entries are sorted by falling \(\chi ^{2}\)

Fig. 16
figure 16

Resulting spectra for the fit to the observables with the scaled luminosity spectrum, detector resolutions, and a binning of \(40\times 50\times 50\). ac Basic luminosity spectrum from GuineaPig compared to the Model after the fit. The integral over the full range is normalized df Relative difference between GuineaPig and the Model after the fit. The horizontal error bars mark the statistical uncertainty from GuineaPig, the additional error bars mark the uncertainty from the parameters

The parameters \(\eta ^{\mathrm {b}}_{\mathrm {Arm1}} \) and \(\eta ^{\mathrm {b}}_{\mathrm {Arm2}} \), which represent the upper edge of the beam-energy spread of the Arms, show a significant dependence on the binning. For the other parameters the change is below one sigma. It is also visible that with more bins the parameter \(p_{\mathrm {Peak}}\) rises, while the two parameters \(p_{\mathrm {Arm1}}\) and \(p_{\mathrm {Arm2}}\) fall, which is also visible in the correlation matrix and their correlation coefficient of about \(-0.4\).

Figure 17 shows the parameters obtained in the fit to the observables. The results are again sorted by decreasing \(\chi ^{2}/\mathrm {ndf} \), which defines the Binning ID. In the figure the different markers give the number of bins used for the energy observables. As the \(\chi ^{2}/\mathrm {ndf} \) falls with increasing number of bins the larger the Binning ID the large is also the number of bins used for the relative centre-of-mass energy observable.

Fig. 17
figure 17

Parameter variation with respect to the Binning ID for the fits to the observables with the scaled luminosity spectrum and including detector resolutions. The entries are sorted by decreasing \(\chi ^{2}\) and the colours and markers give the number of bins used for the energy observables

The parameter values depend much stronger on the number of bins. This is mostly due to the inclusion of the detector resolutions. Without a minimum number of bins the peak structure cannot be resolved, and the \(\omega ^{}_{}\)-parameters are completely different from the previous results and show large fluctuations in their values. If a large enough number of bins is used, the results are only a few sigma different from the previous fit results. The detector resolutions have a strong impact on resolving the structure of the luminosity peak.

6 Systematic impact on smuon mass measurement

There are significant differences between the reconstructed luminosity spectrum and the one from GuineaPig when looking at large event samples. Typical cross-sections for New Physics phenomena will be much smaller than that of Bhabha scattering, and the luminosity spectrum sampled for a specific process will therefore have larger statistical fluctuations, so that the difference between the reconstructed and actual spectrum might not be significant. To estimate the impact of the difference between GuineaPig and the reconstructed spectrum, the measurement of the smuon mass \(m_{\tilde{\upmu }^{\pm }}\) and neutralino mass \(m_{\tilde{\upchi }^{0}}\) from smuon pair production is used. In this model the masses are \(m_{\tilde{\upmu }^{\pm }} =1011~\mathrm {GeV}\) and \(m_{\tilde{\upchi }^{0}} =340~\mathrm {GeV}\).

The smuon decays into a muon and a neutralino, so that the energy spectrum of the muons \(f(E_{\upmu })\) can be used to extract the smuon and neutralino masses. The details of the analysis are described elsewhere [28], here only the parts directly concerning the systematic uncertainty from the luminosity spectrum are repeated. There are some differences in the treatment of the statistical uncertainty between the version of the fitting program used here, and the one used in the original paper.

In an ideal situation—with a single centre-of-mass energy \(\sqrt{{s_{\mathrm {nom}}}}\)  —the muon energy spectrum is a uniform distribution \(\mathrm {U}\bigl (E_{\upmu } \bigr ) \) with the boundaries [29]

$$\begin{aligned} E_{\mathrm {H,L}} = \frac{\sqrt{{s_{\mathrm {nom}}}}}{4} \left( 1 - \frac{m_{\tilde{\upchi }^{0}} ^{2}}{m_{\tilde{\upmu }^{\pm }} ^{2}} \right) \left( 1 \pm \sqrt{1 - 4 \frac{m_{\tilde{\upmu }^{\pm }} ^{2}}{s_{\mathrm {nom}}}}\;\right) . \end{aligned}$$
(32)

The uniform distribution therefore depends on the smuon and neutralino masses.

In reality, there is not a single centre-of-mass energy, and for every centre-of-mass energy the uniform distribution has different limits. Therefore, the measured muon-energy spectrum is affected by the basic luminosity spectrum, the initial state radiation, the cross-section, and the detector resolution \(\mathrm {D}\bigl (E_{\upmu } \bigr )\). The luminosity spectrum \(\fancyscript{L}\bigl (x \bigr )\), initial state radiation \(\mathrm {ISR}\bigl (x \bigr )\), and cross-section \(\sigma _{\tilde{\upmu }\tilde{\upmu }}\bigl (\sqrt{s} \bigr )\) can be combined into the number of events per centre-of-mass energy \(N_{\mathrm {eff}}(x)\). The initial state radiation and luminosity spectrum are convoluted and the resulting function is multiplied with the smuon-pair production cross-section and with the total integrated luminosity \(L_\mathrm {int}\)

$$\begin{aligned} N_{\mathrm {eff}}(x) = L_\mathrm {int} \cdot \Bigl (\fancyscript{L}\otimes \mathrm {ISR}\Bigr )\bigl (x \bigr )\cdot \sigma _{\tilde{\upmu }\tilde{\upmu }}\bigl (x \sqrt{{s_{\mathrm {nom}}}} \bigr ). \end{aligned}$$
(33)

The initial state radiation function \(\mathrm {ISR}\) describes the distribution of the energy after the radiation of initial state radiation. In this case, the distribution is obtained from the Monte Carlo generator used to generate the smuon events. It is here assumed to be independent of the nominal centre-of-mass energy. The function to fit the muon energy spectrum is then the convolution of the uniform energy spectrum with the detector resolution weighted by the respective number of events

$$\begin{aligned} f(E_{\upmu })&= \int \limits _{0}^{\infty }\;N_{\mathrm {eff}}(x) \times \int \limits _{E_\mathrm {L}\bigl (\sqrt{s^{\scriptscriptstyle \prime }} \bigr )}^{E_\mathrm {H}\bigl (\sqrt{s^{\scriptscriptstyle \prime }} \bigr )} \mathrm {U}\bigl (m_{\tilde{\upmu }^{\pm }},m_{\tilde{\upchi }^{0}},x \sqrt{{s_{\mathrm {nom}}}},\tau \bigr ) \nonumber \\&\quad \times \mathrm {D}\bigl (E_{\upmu }-\tau \bigr ) \;\mathrm {d}\tau \;\mathrm {d}x. \end{aligned}$$
(34)

Figure 18a shows the background-subtracted signal sample and an example fit with Eq. (34). To estimate the impact of the reconstruction, the fit results when the luminosity spectrum is taken directly from GuineaPig are compared with those, when the spectrum is coming from the reconstruction.

Fig. 18
figure 18

a Background subtracted signal sample and the best fit to extract the smuon and neutralino mass. b Reconstructed masses with the luminosity spectra taken from the fits to the Bhabha observables with different binnings. The dashed lines mark the result obtained with the GuineaPig spectrum

The masses extracted from the fit with Eq. (34) become a function of the parameters \(\mathbf {p}\) from the spectrum reconstruction \(m = m(\mathbf {p})\) with the luminosity spectrum reconstructed from the Bhabha events. To estimate the systematic uncertainty due to the reconstruction of the spectrum, the fit is performed with the nominal set of parameters \(\mathbf {p}\) and with each parameter \(p_{i}\) increased or decreased by half of a standard deviation \(\sigma _{p_{i}}\)

$$\begin{aligned} m^{+}_{i} = m\left( \mathbf {p} + \mathbf {e_{i}}\frac{\sigma _{p_i}}{2} \right) ,\qquad m^{-}_{i} = m\left( \mathbf {p} - \mathbf {e_{i}}\frac{\sigma _{p_i}}{2} \right) . \end{aligned}$$
(35)

The systematic uncertainty on the fitted value is then given by

$$\begin{aligned} \sigma _{m} = \left( \sum _{i,j} \delta _{i} C_{ij} \delta _{j}\right) ^{1/2} \end{aligned}$$
(36)

with \(\delta _{i} = m^{+}_{i} - m^{-}_{i} \), and the correlation matrix \(C\).

Table 5 lists the smuon and neutralino masses from the fit when the luminosity spectrum in Eq. (33) is directly taken from GuineaPig and when the luminosity spectrum is obtained from the reconstruction with the observables with the scaled luminosity spectrum and detector resolutions with a binning of \(50\times 40\times 40\) bins. The difference in the reconstructed masses for these two luminosity spectra is smaller than the statistical uncertainty. However, as the reconstructed luminosity spectrum shows a dependence on the binning, so do the reconstructed masses. Figure 18b shows the reconstructed masses for the spectra reconstructed with different binnings. There is a dependence of the reconstructed masses on the number of bins, but the spread of the reconstructed masses is smaller than the statistical uncertainty (cf. Table 5).

Table 5 Extracted smuon and neutralino masses from the fits to the signal sample using different (effective) luminosity spectra

As the difference between the obtained masses and the spread of masses is smaller than the statistical uncertainty, the reconstruction of the luminosity spectrum does not introduce a significant bias compared with the statistical uncertainty. The systematic uncertainty due to the luminosity spectrum reconstruction is also much smaller than the statistical uncertainty, so that the total uncertainty on the reconstructed mass is not increased significantly.

7 Summary, conclusions, and outlook

A framework has been developed for the reconstruction of the basic luminosity spectrum at future linear colliders. The spectrum can be reconstructed from Bhabha events measured with the tracking detectors and calorimeters. All important effects were included: the luminosity spectrum from beam-beam simulations—including the non-Gaussian CLIC beam-energy spread—the \(\sqrt{s^{\scriptscriptstyle \prime }}\)-dependence of the Bhabha cross-section, Initial and Final State Radiation, and the detector resolutions.

The Model of the 3 TeV CLIC luminosity spectrum, required for the reweighting fit, has some limitations. For technical reasons the energy range to describe the tail of the Beamstrahlung is limited to \(\sqrt{s^{\scriptscriptstyle \prime }} > 1500~\mathrm {GeV}\), and the peculiar beam-energy spread cannot be modelled precisely with few parameters. The reweighting fit itself does not impair the reconstructed spectrum. The differences between GuineaPig and the reconstructed spectrum do not significantly change between the fit to the basic luminosity spectrum and the fit to the observables with the scaled luminosity spectrum and including detector resolutions. With an improved model, and increased processing power, an improved reconstruction of the CLIC 3 TeV spectrum should be possible.

The fraction of events above 99  % of the nominal centre-of-mass energy is reconstructed within 1 percentage point. The centre-of-mass energy distribution is reconstructed to better than 5  % between the nominal and about half the nominal centre-of-mass energy, the validity limit of our Model. These results are obtained regardless of the included level of details, so that one can conclude that the limitations of the Model cause most of the discrepancies to the simulated spectrum, and if a better model is used, the discrepancies should be reduced.

To estimate the systematic impact on other physics measurements, the reconstructed spectrum was used in the study of smuon decays, one of the CLIC 3 TeV benchmark processes. The reconstructed spectrum does not induce a significant bias on the measured mass, nor does it cause a significant systematic uncertainty. The systematic uncertainty from the spectrum reconstruction is two orders of magnitude smaller than the statistical uncertainty.

The spectrum is well enough reconstructed for the chosen physics channel. In this case a good reconstruction of the tail of the spectrum is tested. The reconstruction of the peak is less important, because the process is far above threshold and the cross-section does not change significantly over the peak region. More work is needed to evaluate and possibly improve the reconstruction of the peak.

7.1 Outlook

The framework can also be applied for the reconstruction of the luminosity spectrum at other centre-of-mass energies and linear electron–positron colliders than CLIC. Depending on the beam-energy spread and the demanded range of the reconstruction, the Model has to be adapted, but this will not increase the computational complexity of the reconstruction.

The energy range of the current Model can be increased by replacing the single Beamstrahlung beta-distributions by linear combination of beta-distributions. Improving the description of the beam-energy spread is less obvious without a large increase in the number of parameters.

The boundaries of the beam-energy spread—the parameters \(x_{\min }\) and \(x_{\max }\)—were fixed during the fit. It should be evaluated how much the measurement is affected, when these parameter values differ from those of the beam-energy spread. It should also be tried to vary the boundaries of the beam-energy spread during the re-weighting fit. For varying these parameters during the reweighting fit the initial samples have to be produced with overlapping regions. For example, the peak region would be produced with an \(x_{\min }\) smaller than the upper limit of the arm or body regions. During the re-weighting the value for \(x_{\min }\) or \(x_{\max }\) would be given by the minimizer, and events in the peak below \(x_{\min }\) or above \(x_{\max }\) would be dropped, as would events in the arm or body above their respective upper limit.

The observables from the Bhabha events can also be exchanged for other suitable choices, always keeping the detector resolutions in mind. The impact of the detector resolutions on the reconstructed spectrum can be easily studied by changing the resolutions used in the four-vector smearing. The same detector resolutions and Bhabha generator were used for the GuineaPig and Model events. Differences in the predicted detector resolution and Bhabha scattering to the actual events can introduce systematic errors into the reconstruction. These effects could be studied by varying the detector resolutions or the Bhabha cross-section independently for the two samples used in the fit.

Only Bhabha events—and no other physics processes—were considered. It should be checked if multi-peripheral two-photon events, in which the spectator electrons scatter at large angles, are a background.

As the luminosity spectrum depends on the accelerator, the impact of possible variations of the beam parameter on the reconstruction of the luminosity spectrum should be studied with realistic variations of the beam parameters.