Abstract
In this paper, the experimental results of the recent dynamic aperture at top energy for the CERN Large Hadron Collider are analysed by means of a diffusion model whose novelty consists of deriving the functional form of the diffusion coefficient from Nekhoroshev theorem. This theorem provides an optimal estimate of the remainder of perturbative series for Hamiltonian systems. As a consequence, a three-parameter diffusion model is built that reproduces the experimental results with a high level of accuracy. A detailed discussion of the physical interpretation of the proposed model is also presented.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The dynamic aperture (DA) is the amplitude of the phase space region where stable, i.e. bounded, motion occurs. DA is one of the key quantities for the design of modern colliders based on superconducting magnets, which feature unavoidable non-linear field errors, such as the Tevatron [1,2,3], HERA [4,5,6,7], RHIC [8], the Superconducting Super Collider (SSC) [9, 10], and LHC (see, e.g. Ref. [11] for a detailed overview).
The study of transport in the phase space of non-integrable Hamiltonian systems is a very difficult problem due to the coexistence of weakly chaotic regions and invariant Kolmogorov–Arnold–Moser (KAM) tori [12] that implies a sensitive dependence of the orbit evolution on the initial conditions. The relevance of Arnold diffusion [13], a generic phenomenon in Hamiltonian systems with two or more degrees of freedom, in applications is still debated.
Macroscopic physical systems cannot realise the symplectic character of the dynamics at arbitrary spatial and time scales. Nonetheless, some results of Hamiltonian perturbation theory turn out to be robust with respect to the details of the considered system and they can provide effective laws for the study of stability and diffusion problems of the orbits. Nekhoroshev theorem [14] is an excellent example of such a result, the corresponding estimate for the orbit stability time being applied in several fields ranging from celestial mechanics to accelerator physics, where in recent years, a connection between Nekhoroshev theorem and time variation of the dynamics aperture has been established [15].
In a mathematical sense, the stability property requires an arbitrarily large time scale. In a physical context, however, particle stability can be linked to a maximum number of turns \(N_{\mathrm{max}}\) that is determined on the basis of the specific application. Let (x, y) be the transverse spatial coordinates describing the betatronic motion in a collider, if an ensemble of initial conditions defined on a polar grid (\(x = r \, \cos \theta \, , \,\, y = r \, \sin \theta \,\,\, 0 \le \theta \le \pi /2\), where x, y are expressed in units \(\sigma _x, \sigma _y\) of the beam dimension) is tracked for up to \(N_{\mathrm{max}}\) turns, then a measure of the DA can be defined as [15]:
where \(r(\theta ; N)\) stands for the last stable amplitude in the direction \(\theta \) for up to N turns. Note that in case the stable phase space region is made of disconnected parts, only the area surrounding the origin is retained in these computations. In this way, the DA can be considered a function of N, with an asymptotic value, when it exists, representing the DA for an arbitrary large time.
An accurate numerical computation of DA, as well as a good estimate of the numerical error associated with the numerical protocol used is of paramount importance to ensure the reliability of DA as a figure-of-merit for assessing synchrotron performance. A general discussion of the DA definition, its computation, and accuracy can be found, e.g. in Ref. [15].
DA computation requires the determination of the evolution of a large number of initial conditions, distributed to provide good coverage of the phase space under study, to probe whether their motion remains bounded over the selected time interval. While the computational burden of a large set of initial conditions can be easily mitigated by means of parallelism [16], it is not possible to mitigate the heavy CPU power needed for long-term simulations. Hence, studies have explored the possibility to describe the DA dependence on the number of turns using simple models [17, 18]. The underlying idea is that long-term behaviour of the DA can be extrapolated using knowledge from numerical simulations performed over a smaller number of turns. Additionally, a more efficient estimate of the long-term behaviour of the DA would expedite analysis of several configurations of the circular accelerator, which is sometimes mandatory to gain insight into the deeper nature of the beam dynamics.
The Nekhoroshev [14] theorem suggests an answer to the quest for modelling the time evolution of DA. In fact, according to the results of Refs. [17, 18], the following scaling law holds:
where \(D_{\infty }\) represents the asymptotic value of the amplitude of the stability domain, b and \(\kappa \) being additional parameters.
The model (2) gives the following rough description of the transverse phase space, in which we distinguish three macroscopic regions: an inner central core around the origin \(r < D_{\infty }\), where the measure of the KAM [12] invariant tori is large, thus producing a stable behaviour apart from a set of a very small measure where Arnold diffusion can take place; a surrounding region, with \(r > D_{\infty }\), where a weak chaos is present and the escape rate is reproduced by a Nekhoroshev-like estimate [14, 19, 20]; an outer region where most of orbits escape quickly towards the infinity. In the region \(r > D_{\infty }\) the model (2) provides an estimate of the stability time as a function of the amplitude r of the form:
where N(r) is the number of turns that are estimated to be stable for particles with initial amplitude smaller than r, and \(r_*\) is a positive parameter.
According to the scaling law (2), it has been proposed a model for the evolution of beam intensity in a hadron synchrotron [22], which is the basis of the novel experimental method used to probe DA.
In this paper, we use a diffusive approach to reproduce the experimental results from the recent DA experiment at top energy in the LHC. The beam dynamics in the weakly chaotic region is governed by a stochastically perturbed Hamiltonian system, which in turns is described by means of a Fokker–Planck (FP) equation [21], whose solution represents the average evolution of the beam distribution, including also absorbing boundary conditions. This approach allows the diffusion process for the particle distribution to be simulated, providing a natural description of the beam dynamics in the presence of a collimation system, which is a typical situation in colliders based on superconducting magnets.
The novelty of the proposed approach consists of using the remainder estimate of the perturbative series from Nekhoroshev theorem as functional form for the diffusion coefficient of the FP equation. Indeed, Nekhoroskev approach to the optimal estimate of the remainder of perturbative series [20] represents the link between the analysis of the beam dynamics based on the scaling law of DA and that based on a diffusion equation: for the first the theorem provides the form of the scaling law of D(N), while for the latter the theorem provides the form of the diffusion equation.
The plan of the paper is the following: in Sect. 2 the main aspects of the theory of diffusion processes in stochastically perturbed Hamiltonian system are reviewed, while in Sect. 3 the experimental technique is described. The main results of our analysis are presented and discussed in Sect. 4, where a detailed comparison between the theoretical approach based on the diffusion equation and the experimental measures is presented. In Sect. 5 the phase space of the system under consideration is studied by means of symplectic-tracking simulations to provide a confirmation of the assumptions used for the diffusive approach. Some conclusions are drawn in Sect. 6, whereas the mathematical details of the proposed approach are presented in Appendices A–C.
2 Theoretical background
The results of perturbation theory of Hamiltonian systems imply that when the set of invariant KAM tori in phase space has a large measure, the orbits’ diffusion is possible only for a set of initial conditions of extremely small measure [23]. Therefore, the existence of macroscopic diffusion phenomena in phase space has to be related to the presence of weak chaotic regions of large measure in which the large majority of KAM tori are broken [24]. Note that in realistic models of betatron motion, slow modulation of the strength of lattice elements, transverse tune ripple induced by synchrotron motion, or weak stochastic effects, such as noise in active devices, may lead to the appearance of such regions.
Nekhoroshev’s theorem provides optimal estimates for the remainders of the asymptotic perturbative series for Hamiltonian flows, however, also in the case of a symplectic map in the neighbourhood of an elliptic fixed point, it is possible to provide an optimal estimate for the Birkhoff normal forms series [19, 20].
Let I be the unperturbed action, there exists an optimal perturbation order of the Birkhoff’s expansion at which the remainder is estimated according to:
where \(I_*\) represents an apparent radius of convergence of the perturbative series and the exponent \(\kappa \) depends on the number of degrees of freedom of the system under consideration. We conjecture that under generic conditions the functional form of Nekhoroshev estimate (4) can be applied to measure the strength of the chaotic component of the dynamics in the weak chaotic region. Assuming a diffusion approach for the evolution of the action distribution (see Appendices for the mathematical details) in the one-dimensional case, the Fokker–Planck equation holds:
where \(\varepsilon \) is a scaling factor related to the perturbation amplitude. The Nekhoroshev’s estimate suggests that the following functional form for the action-diffusion coefficient
is suitable to simulate the action diffusion under the previous assumptions. The constant c is computed by normalising the diffusion coefficient according to:
where \(I_{\mathrm{abs}}\) represents the position of the absorbing boundary condition. The physical meaning of the parameters \((\varepsilon , \kappa , I_*)\) that characterise the diffusion model (5) and (6) is readily derived from Nekhoroshev’s theorem: (1) \(\varepsilon \) is an adimensional quantity that measures the strength of the non-linear effects acting on the beam; (2) the exponent \(\kappa \) emerges from the analytic structure of the perturbative series and it mainly depends on the phase space dimensionality and on the nature of the non-linear terms that occur in the perturbative series, independently from their strength; (3) \(I_*\) reveals the asymptotic character of the perturbative series and it is related to the strength of non-linear terms. Usually, the region in phase space corresponding to \(I\simeq I_*\) is beyond the short-term dynamic aperture, where our approximation is no more valid. It is worthwhile mentioning that the parameters \(\varepsilon \) and \(I_*\) are in principle correlated since a scaling in the action changes the strength of the perturbation. However, the position of the absorbing barrier is invariant with respect to the global time scaling \(\varepsilon \), whereas it depends on the action scaling.
In Fig. 1, we plot the behaviour of the diffusion coefficient (6) (upper) using parameter values relevant for the comparison with experimental data (see Sect. 4) and we show an example of the numerical solution of the FP equation (5) (lower) with exponential initial distribution and an absorbing boundary condition.
One clearly sees the effects of the functional form of the Nekhoroshev diffusion coefficient: after a rather fast initial diffusion, the evolution of the beam distribution slows down. Moreover, the changes to the initial shape of the distribution are limited to its tails. This is the reason for the existence of a stable region of finite extent in phase space for finite time, which would give rise to a finite dynamic aperture.
3 LHC dynamic aperture experiment at top energy
With the advent of the LHC and the approval of its high-luminosity upgrade [25], the topic of measuring the DA by means of beam experiments has regained interest, after a break between the design phase of the LHC (see, e.g. Refs. [27,28,29,30] for a review of the comparison between measurements and simulations), and its commissioning and following operation periods.
DA measurements at the LHC (see Fig. 2, upper, for a layout of the LHC ring) have been already carried out at injection energy [26, 31, 32] using different approaches, i.e. the standard kick method [26] or the new approach [31, 32]. For the latter, the technique consists of blowing up both the horizontal and vertical emittances until beam losses can be detected. The beam intensity as a function of time is then recorded, fitted, and compared with the results of numerical simulations, usually showing a very nice agreement [33]. During these measurements, the strength of non-linear elements located in the regular cell of the accelerator (see Fig. 2, middle, for a layout of the LHC cell, including also the non-linear correctors used) is varied to provide several machine configurations to be studied by means of numerical simulations. The encouraging results obtained at injection energy suggested to pursue the DA measurement at flat top energy.
The goals of the DA measurements performed at 6.5 TeV in the LHC were manyfold: the use of squeezed optics allows probing the impact on beam dynamics of the non-linear field errors stemming from the quadrupoles in the high-luminosity insertions. Thus, one could examine and quantify the influence on beam loss and lifetime from changes in the strength of the normal dodecapole correctors (see Fig. 2, bottom, for a sketch of the high-luminosity insertions, whose magnets were used during the experiment) in the ATLAS and CMS interaction regions (IR) 1 and 5, respectively. This aspect is particularly relevant in view of the future High Luminosity LHC project [25], for which the operational strategy to set the non-linear correctors in the high-luminosity IRs is still to be studied. Moreover, in previous studies, the beam emittance was heated to large values in both horizontal and vertical planes. While this can be considered a benefit of the method insofar as it gives a measure of changes to the average DA over all angles in the x–y plane, it may also be regarded as a limitation since with such an approach it is not possible to distinguish between changes of the horizontal and vertical dynamic aperture. To help rectify this aspect, it was decided to measure the dynamic aperture simultaneously for three bunches in a single beam. One bunch was heated horizontally (H blow up, in the following), one vertically (V blow up, in the following), and one in both planes (H–V blow up, in the following). Note that a witness bunch of small transverse emittance provides a reference case. The key objective of these measurements was related to the time scale achieved. Typical DA simulations are performed over \(10^{5}\)–\(10^{6}\) turns (\(\sim 8\)–88 s of LHC operation) and previous measurements have been performed on the 5–10 min time scale [31, 32]. Operational time scales at top energy in the LHC, by contrast are of the order of \(\sim 12\) h. To justify the extrapolation of simulated data that can be viably studied numerically to orders of magnitude longer times, it is also necessary to establish whether the analytical scaling laws hold over these same time scales. Thus the final objective of this novel measurement campaign was to perform dedicated dynamic aperture measurements on the time scale of an hour, significantly longer than any previous measurement in the LHC.
The experiment was performed using both the clockwise beam (Beam 1) as well as its counter-clockwise partner (Beam 2). The first was made of a single bunch blown up in both horizontal and vertical planes, while the latter comprised four bunches with different emittance blow as mentioned earlier. The transverse damper was used to provide a dipolar excitation, which blows up the transverse emittance due to band-limited, white noise excitation that is injected into the transverse damper feedback loop [34].
The value of \(\beta ^{*}\) in the IR1 and IR5 experimental insertion was 0.4 m. The primary collimators were set at \(\sim 9\,{\sigma _{\mathrm{nom}}}\), while the tertiary collimators were positioned at \(\ge 15\,{\sigma _{\mathrm{nom}}}\), which are significantly in excess than that defined by the horizontal and vertical primaries. The value of \({\sigma _{\mathrm{nom}}}\) is computed assuming the nominal value of the rms normalised emittance, namely \(\epsilon ^{*}_{\mathrm{nom}}=3.75\,{\upmu \mathrm{{m}}}\).
After removing large orbit bumps in the experimental insertions, the fractional tunes were re-corrected to (0.31, 0.32) and chromaticity was set to \(Q'_{x,y}=3.0\) units for both planes and beams. Linear coupling was trimmed down to a value of \(|C^{-}|\approx 0.001\), which is at the limit of the measurement resolution. Having established the baseline conditions for the study, DA measurements were first performed by aggressively blowing up the Beam 2 bunches using the transverse damper up to very large emittances \(\sim \,25\,{\upmu \mathrm{{m}}}\). Large dodecapole sources were introduced by powering the IR-\(b_{6}\) correctors left and right of the interaction point (IP) 1 and 5 uniformly to their maximum current. Then, the single bunch in Beam 1 was also blown up in horizontal and vertical planes, thus allowing DA measurements for both beams. Approximately 1 h of intensity data were recorded in this configuration. Finally the IR non-linear corrections for normal and skew sextupole and normal and skew octupole errors, which had been commissioned at the start of 2017, were collectively removed, and approximately 30 min of intensity data were recorded for this final configuration. Additional details regarding the experimental session and the LHC setup can be found in Ref. [35]. Note that the accuracy of the beam intensity measurement is at the level of \(10^{-3}\).
The reported experimental procedure has been carefully prepared to avoid as much as possible disruptive effects on the results. The intensity of each individual bunch was in the order 7–\(8\times 10^9\) protons (note that the nominal bunch intensity for LHC is \(1.15\times 10^{11}\)), to prevent any collective effect to impact the measurements. The large transverse emittance prevented any brightness-related effects. Synchrotron radiation damping times are in the order of 12 h and 24 h for longitudinal and transverse emittance, respectively, which implies that no impact is to be expected on the timescale of our measurements. Finally, the lifetime for residual gas scattering is estimated to be 100 h, hence completely negligible.
The summary plots from the experimental session are shown in Fig. 3, where the evolution of the relative strength of the non-linear correctors and the bunch intensity are visible. The two machine configurations are characterised by different levels of beam losses depending on the transverse emittances. A careful inspection of the summary plots for Beam 2 leads to the conclusion that the beam losses occur preferentially in the vertical plane. It is also worth stressing that the two beams are coupled by the single-aperture magnets in the experimental IRs, this is, e.g. the case of the non-linear correctors used in this experiment, whereas the remaining parts of the two rings are different, which implies that a different behaviour of Beam 1 and 2 for similar conditions of emittance blow-up should not come as a surprise.
Figure 4 shows the measured transverse profiles of the two beams after the emittance blow up at the beginning and at the end of the loss measurement reported in Fig. 3c (note that the following considerations hold true for all experimental configurations presented here). The profiles have been obtained by means of the synchrotron light monitor and the slight left–right asymmetry of the horizontal profile of Beam 2 is an artefact of the instrument and should be neglected [36]. The values of the \(\sigma \) of the two distributions are the same at the percent level and in general the two profiles match each other very well. The measurements are dominated by the noise whenever the transverse amplitude exceeds \(\approx 2.5~\sigma \), while below this value, the transverse profiles prove to be Gaussian. The initial Gaussian distribution and the final one, as obtained from numerical simulations (see Sect. 4), are also shown. It is clearly seen that the diffusion mechanism is acting on the initial distribution by changing only the tails beyond \(\approx 2.5~\sigma \). The typical losses observed are at the level of 1–2% of the bunch intensity, which agrees with the tail content of Gaussian beyond \(\approx 2.5~\sigma \). Therefore, these observations suggest that a Gaussian initial distribution is an appropriate choice, although the synchrotron radiation monitor does not provide any direct quantitative measurement of the actual tails of the beam distribution.
Note that in the rest of the paper the configuration in which all IR correctors are powered will be indicated as ‘with correctors’, while that with the dodecapolar corrector only as ‘no correctors’.
4 Modelling the experimental results with a diffusion equation
The experimental results presented in Sect. 3 have been analysed by means of a 1D FP equation (5) with a Nekhoroshev-like form of the diffusion coefficient as in Eq. (6) (parenthetically, the data from the measurement campaign performed at injection energy in the LHC have been re-analysed using the diffusive approach and the discussion of the results can be found in Ref. [37]). The model needs to find an efficient method to determine the three parameters. The first step is to constrain the model to agree with the measured intensity curve at the end of the experimental time window and this fixes \(\varepsilon \). As a second step, the FP equation has been used to reproduce the various experimental cases determining the values of \(\kappa \) and \(I_*\) by minimising the \(L^2\) norm of the difference between the solution of the FP equation with an absorbing boundary condition at \(I=I_{\mathrm{abs}}\) and the corresponding measured intensity curve. This provided pairs of values for \(\kappa _j, {I_*}_j\) for each experimental configuration. The main observation is that \(\kappa \) is only very mildly depending on the configuration, which is in agreement with the fact that it should be linked with the number of degrees of freedom of the system under consideration. Therefore, the average of \(\kappa _j\) has been used as an estimate of \(\kappa \) for all data sets. The third and last step has been the computation of the solution of the FP equation for the various configurations using the only remaining free parameter \(I_*\) to minimise the \(L^2\) as done for the second step. It is worth pointing out that to remove the noise affecting the beam intensity measurement, which is visible in the two rightmost plots of Fig. 3, the intensity data have been filtered by a 50-data moving average.
The procedure described above aims to point out the functional form of the beam losses produced by the Nekhoroshev’s diffusion coefficient as we tried to avoid using the model parameters to obtain the best agreement with each individual measurement, but rather to find a global agreement between the numerical solutions and the measurement results.
Since the action variable I represents the non-linear invariant of the system, we have chosen as initial condition an exponential distribution
where \(\sigma ^2\) stands for the measured beam emittance, to reproduce the measured beam profile as shown in Fig. 4. Moreover, by scaling the action variable \(I\rightarrow I/\sigma ^2\) we can set \(\sigma =1\) in the simulations without affecting the beam loss rate. Finally, the absorbing boundary condition \(I_{\mathrm{abs}}\) is computed from the position of the collimator expressed in units of beam emittance and considering the physical plane where one expects the beam diffusion to be more relevant.
In Table 1 we report the model’s parameters obtained by applying the procedure described above, i.e. from the numerical evaluation of the relative intensity losses at the absorbing barrier with FP (5). It is worth noting that for Beam 2 the case with H blow up and ‘with correctors’ features no appreciable beam losses and therefore, no attempt to derive model’s parameters has been made.
The values of the \(L^2\) norm for the final numerical results are also listed in Table 1. The norm provides a cumulative measure of the deviation between the measured and the simulated beam loss curves and the values are relative to the total beam loss measured for each configuration. The order of magnitude is around few percent, which corresponds to about few \(10^{-4}\) of the absolute intensity loss. Note that the precision with which the beam intensity is measured is below the percent level. Therefore, the overall agreement can be considered excellent.
Figure 5 shows the results of the numerical simulations together with the experimental data for the complete Beam 1 data set.
The agreement between the measured data and the simulations results is excellent as it is indicated by the values of the \(L^2\) norm in Table 1. Figure 6 shows the results of our analysis for the Beam 2 data set, in which the case with H blow up has been discarded due to insufficient level of beam losses.
Also in this case, the agreement between experimental observations and numerical simulations is striking. It is also worth noting that the time span of the various data sets covers a rather wide range of turn numbers and the agreement does not depend on the duration of the measurements.
According to the physical interpretation of the diffusive model parameters, the exponent \(\kappa \) plays a fundamental role in determining the shape of the beam loss curve. The second model’s parameter \(I_*\) defines a transition threshold in the action space from a fast to a slow diffusion and it changes the shape of the curve when its value is comparable with the position of the absorbing barrier.
To illustrate the sensitivity of the simulated beam intensity to the values of \(\kappa \) and \(I_*\) in Fig. 7 we compare the beam loss curves computed with the diffusion model when the two parameters are varied, one at a time, with respect to the optimal value top and centre plots. In the bottom plot the relative difference between the curve reproducing the experimental data and those with varied model parameters is shown.
The value of \(\kappa \), varied by few percent, influences substantially the shape of the initial part of the interpolating curve, right after the initial fast transient beam losses. This observation supports the assumption that the constancy of the exponent \(\kappa \) for the different considered cases can be attributed to an intrinsic property of the observed diffusion process.
In the perturbation theory, the \(I_*\) parameter is interpreted as a global scaling for the perturbative series related to the nature of the non-linear terms present in the system, although it is not directly linked to their magnitude. The effect of the value of \(I_*\) on the shape of the beam loss curve depends on the ratio \(I_{\mathrm{abs}}/I_*\), which provides the position of the absorbing barrier: a greater value of \(I_*\) reduces the beam halo and consequently the beam losses at the position of the absorbing barrier, whereas the opposite effect occurs for lower values of \(I_*\).
In summary, Fig. 7 indicates that the proposed approach is sensitive to changes in \(\kappa \) and \(I_*\) at the level of few percent, which provides a very strong support to the robustness of the proposed model against variation of its parameters. In turns, this means that the differences between the values obtained by the numerical simulations could reflect actual differences in the dynamics occurring in the weak chaotic regions where the diffusion phenomena take place.
5 Symplectic-tracking checks of the experimental observations
The analysis presented in this paper does not rely on anything else than the numerical solution of the FP equation. Nevertheless, some tracking simulations have been performed to assess the choice of the boundary conditions and the plane of losses as well as some of the assumptions needed for the diffusive approach to be a valid option.
The ring model is the most accurate description of the LHC lattice including the measured field errors (see [38] for more detail) together with the operational configuration of the various correction circuits. The numerical protocol used envisages the generation of sixty realisations of the magnetic errors to take into account the measurement uncertainties, moreover, a polar grid of initial conditions in x–y space is defined and its evolution is computed for up to \(10^6\) turns. The polar grid of initial conditions is obtained by dividing the first quadrant of the x–y space in 59 angles and along each direction 30 initial conditions are uniformly distributed over intervals of \(2\sigma \).
The evolution of the initial conditions through the LHC lattice is computed using the SixTrack code [39], which implements a second-order symplectic integration method. The loss time, i.e. the time an orbit associated with a given initial condition reaches a pre-defined amplitude, is recorded and associated to each initial condition. The outcome of these simulations is shown in Fig. 8, where the stable region is shown for Beam 1 (upper row) and Beam 2 (lower row) and for each of the two configurations used in the experiment (‘with correctors’ in the left column, ‘no correctors’ in the right one) for the first realisation of the magnetic errors.
The different colours are used to identify various stability times \(N_{\mathrm{stab}}\), i.e. dark-blue markers indicate particles with \(N_{\mathrm{stab}} < 10^5\) and the marker size is proportional to the value of \(N_{\mathrm{stab}}\). Yellow markers indicate a region for which \(N_{\mathrm{stab}} > 10^5\), while for green markers \(N_{\mathrm{stab}} > 10^6\). The shrinking of the extent of the stable region for increasing values of \(N_{\mathrm{stab}}\) is clearly visible. Moreover, the border of stability is almost circular for Beam 1, whereas it is much more irregular for Beam 2.
Figure 8 shows also three white curves: they represent the \(3\sigma \) level lines of the beam distribution for the three types of blow up applied during the experiment, namely H, V, or H–V. For Beam 1, the beam distribution is rather close to the stability boundary and sizeable losses are to be expected. For Beam 2, it is worth noting that the irregular shape of the stability border implies that the \(3\sigma \) edge of the beam distribution is relatively far from the border itself, which is in agreement with the low beam losses measured for the H blow up case. Both the H–V and V blow up cases feature the edge of the beam distribution close to the stability border in the vicinity of the y axis. This explains qualitatively the higher losses observed for these cases and, of course, also the fact that the case with H–V blow up generates even higher beam losses.
These plots, however, provide only static information about the extent of the stable region of phase space. The time dependence can be reconstructed by means of Eq. (1) and it is shown in Fig. 9, where the Beam 1 (upper row) and the Beam 2 (lower row) cases are shown. The configuration ‘with correctors’ is reported in the left column, whereas that with ‘no correctors’ in the right one.
Each plot features three sets of curves: one representing the DA averaged over all angles in the x–y space according to Eq. (1) (hence representing a situation relevant for the H–V blow-up case); one representing the DA averaged over 10 angles, only, in the vicinity of the x axis (hence representing a situation relevant for the H blow-up case), and the last one representing the DA averaged over ten angles, only, in the vicinity of the y axis (hence representing a situation relevant for the V blow-up case). Each set of curves includes data from all sixty realisations of the LHC lattices.
The Beam 1 case features a much smaller spread among the sixty realisations than the Beam 2 case. Apart from this, however, the results for the two beams share a number of common features: the curves representing the average close to the x axis have the mildest dependence on time; the curves representing the average close to the y axis and the global average have a very similar shape featuring almost a constant shift between them.
The numerical model can also provide quantitative information on other lattice properties, such as the detuning with amplitude. An important condition for the applicability of the approach based on the diffusion equation is that \(\partial \varOmega /\partial I\) is not too small for the cases under consideration. Indeed, the value of \(\partial \varOmega /\partial I\) has been evaluated and turned out to be O(1) as requested for the validity of Eq. (12).
In summary, the results of the symplectic-tracking simulations confirm that the assumptions needed to apply a diffusive approach to the description of beam losses in the LHC are fulfilled.
6 Conclusions
In this paper, a novel diffusive model capable of an excellent agreement with the experimental results of the recent dynamic aperture at the CERN LHC has been presented. The model is inspired by the optimal estimate of the remainder of the perturbative series provided by Nekhoroshev theorem, which we propose as functional form of the diffusion coefficient. The model features three parameters that characterise the diffusion equation and the physical meaning of these parameters has been highlighted and discussed in detail. The model has been successfully applied to the description of the beam loss data sets, which originated from measurements performed at the CERN LHC at 6.5 TeV. The various data sets represent a number of different configurations in terms of non-linear effects in the beam dynamics, which suggests that the excellent agreement obtained between measurements and the model is a generic feature. One of the model parameters, \(\kappa \), is kept the same for all data sets, which is perfectly in line with its physical interpretation. The deviation of the obtained value of \(\kappa \) from the theoretical estimate is being further investigated. Given that a 1D formalism provides a good description of the experimental measurements, as shown in the previous sections, the theoretical estimate would provide \(\kappa =1\). As a matter of fact, the theoretical estimates consider a local transport in the action, whereas we are applying the proposed model to a global action diffusion. Therefore, one could expect that in our case \(\kappa \) is the result of an effective description of the dynamics on a large spatial scale. Furthermore, it is worth stressing that the results of the numerical simulations depend critically on the values of the model’s parameters, i.e. variations at the level of few percent of these parameters change strongly the functional form of the simulated beam losses. This is a very reassuring observation, indicating that the determination of the model parameters is very robust.
In all the considered cases, a 1D approach has been applied and this assumption has been probed by means of symplectic-tracking simulations, which fully supported the choice made. The interesting question whether the diffusion model can be justified on the basis of tracking simulations requires a detailed study of the phase space structure to detect the existence of weakly chaotic regions and to understand the effect of external random perturbations unavoidable in real accelerators. We plan also to investigate an approach based on the 2D Fokker–Planck equation and to examine in depth the relation between the diffusion coefficient and the stability times for the orbits in phase space computed by means of the Nekhoroshev’s estimate [17].
It is also worth stressing that the proposed approach can be used to predict the beam losses as well as the evolution of the beam distribution, extrapolating experimental measurements to long time scales that are well beyond the possibility of present symplectic-tracking codes. The optimal approach should be to couple symplectic tracking with the Fokker–Planck equation: the first, performed over a limited number of turns, but with a detailed exploration of phase space, should provide the information about the functional form of the diffusion coefficient; the latter would provide information on the long-term dynamics, such as beam losses and transverse distributions, possibly covering realistic times scales for accelerator physics applications. Therefore, in this way one could apply the outlined approach as a real predictive technique for future high-energy colliders.
References
N.M. Gelfand, in Proceedings of SSC Workshop on Accelerator Physics Issues for a Superconducting Super Collider, ed. by M. Tigner. Calculations of the Dynamic Aperture at the Tevatron, UM-HE-84-1, p. 124 (1984)
V. Visnjic, in Proceedings of 1991 Particle Accelerator Conference, ed. J. Chew, L. Lizama. Dynamic Aperture of Low Beta Lattices at Tevatron Collider (IEEE Computer Society Press, Piscataway, 1991), p. 1701
V. Visnjic, inProceedings of Workshop On Nonlinear Problems in FutureParticle Accelerators, ed. by W. Scandale, G. Turchetti. Dynamic Aperture of the Future Tevatron Collider (World Scientific, Teaneck, 1991)
R. Brinkmann, F. Willeke, Proceedings of the 1988 European Particle Accelerator Conference, ed. by S. Tazzari. Persistent Current Field Errors and Dynamic Aperture of the Hera Proton Ring (World Scientific, Singapore, 1988), p. 911
F. Zimmermann, F. Willeke, Long term stability and dynamic aperture of the HERA proton ring, DESY-HERA-91-08 (1991)
F. Zimmermann, Dynamic aperture and transverse proton diffusion in HERA. AIP Conf. Proc. 326, 98 (1995)
O.S. Brüning, F. Schmidt, W. Fischer, F. Willeke, Comparison of measured and computed dynamic aperture for the SPS and the HERA proton ring. Part. Accel. 54, 223 (1996)
Y. Luo, M. Bai, J. Beebe-Wang, W. Fischer, A. Jain, C. Montag, T. Roser, S. Tepikian, D. Trobjevic, in Proceedings of 2007 Particle Accelerator Conference, ed. by C. Petit-Jean-Genaz. Dynamic Aperture Evaluation at the Current Working Point for RHIC Polarized Proton Operation (IEEE Computer Society Press, Piscataway, 2007), p. 4363
T. Sen, A.W Chao, Y. Yan, in Proceedings of 1991 Particle Accelerator Conference, ed. by L. Lizama, J. Chew. Effect of Tune Modulation on the Dynamic Aperture of the SSC Lattice (IEEE Service Center, Piscataway, 1991), p. 345
L. Schachinger, Y. Yan, Recent SSC Dynamic Aperture Measurements From Simulations, SSC-N-664 (1989)
O. Brüning, P. Collier, P. Lebrun, S. Myers, R. Ostojic, J. Poole, P. Proudlock (eds.), LHC Design Report, vol. 1. CERN-2004-003-V-1 (2004)
C.L. Siegel, J. Moser, Lectures in Celestial Mechanics (Springer, Berlin, 1971)
V. Arnold, Instability of dynamical systems with several degrees of freedom. Soviet Math. 5, 581 (1964)
N. Nekhoroshev, An exponential estimate of the time of stability of nearly-integrable Hamiltonian systems. Russ. Math. Surv. 32, 1 (1977)
E. Todesco, M. Giovannozzi, Dynamic aperture estimates and phase-space distortions in nonlinear betatron motion. Phys. Rev. E 53, 4067 (1996)
M. Giovannozzi, E. McIntosh, Development of parallel codes for the study of nonlinear beam dynamics. Int. J. Mod. Phys. C 8, 155 (1997)
M. Giovannozzi, W. Scandale, E. Todesco, Prediction of long-term stability in large hadron colliders. Part. Accel. 56, 195 (1996)
M. Giovannozzi, W. Scandale, E. Todesco, Dynamic aperture extrapolation in presence of tune modulation. Phys. Rev. E 57, 3432 (1998)
G. Turchetti, in Proceedings of Number Theory and Physics, vol. 47, ed. by J. M. Luck, P. Moussa, M. Waldschmidt. Nekhoroshev Stability Estimates for Symplectic Maps and Physical Applications, Springer Proceedings in Physics (Springer, Berlin, 1990), p. 223
A. Bazzani, S. Marmi, G. Turchetti, Nekhoroshev estimate for isochronous non resonant symplectic maps. Cel. Mech. 47, 333 (1990)
A. Bazzani, S. Siboni, G. Turchetti, Diffusion in Hamiltonian systems with a small stochastic perturbation. Phys. D 76, 8 (1994)
M. Giovannozzi, Proposed scaling law for intensity evolution in hadron storage rings based on dynamic aperture variation with time. Phys. Rev. ST Accel. Beams 15, 024001 (2012)
A. Morbidelli, A. Giorgilli, On a connection between KAM and Nekhoroshev’s theorems. Phys. D Nonlinear Phenom. 86–3, 514 (1995)
C. Froeschlé, E. Lega, Weak chaos and diffusion in hamiltonian systems. Dyn Small Bodies Solar Syst NATO ASI Ser Book Ser 522, 463 (1999)
High-Luminosity Large Hadron Collider (HL-LHC). in Technical Design Report V.0.1, vol. 4, ed. by G. Apollinari, I. Bejar Alonso, O. Brüning, P. Fessia, M. Lamont, L. Rossi, L. Tavian, CERN Yellow Reports: Monographs, CERN-2017-007-M (CERN, Geneva, 2017). https://doi.org/10.23731/CYRM-2017-004
E.H. Maclean, R. Tomás, F. Schmidt, T.H.B. Persson, Measurement of nonlinear observables in the Large Hadron Collider using kicked beams. Phys. Rev. ST Accel. Beams 17, 081002 (2014)
T. Collins, A.J. Dragt, R.P. Johnson, G. Lambertson, W. Scandale, R. Talman, F. Willeke, Proceedings of the 1984 Snowmass Workshop on Design and Utilization of the Superconducting Super Collider, ed. by R. Donaldson, J. G. Morfin, An Experimental Proposal to Use the Tevatron as an Aid to SSC Design by Testing Our Understanding of Effects Controlling Dynamic Aperture (1984), p. 361
J.M. Peterson, A.W. Chao, D. E. Johnson, S.G. Peggs, C.G. Saltmarsh, L.C. Schachinger, R.M. Talman, D.A. Edwards, D. Finley, N.M. Gelfand, R.E. Gerig, M. Harrison, R.P. Johnson, N. Merminga, M.J. Syphers, P.L. Morton, Proceedings of the 1988 European Particle Accelerator Conference, ed. by S. Tazzari, Dynamic Aperture Measurements at the Tevatron (World Scientific, Singapore, 1988), p. 266
W. Fischer, M. Giovannozzi, F. Schmidt, The Dynamic aperture experiment of the CERN SPS. Phys. Rev. E 55, 3507 (1997)
F. Willeke, Proceedings of 1995 Particle Accelerator Conference. Comparison of Measured and Calculated Dynamic Aperture (IEEE Service Center, Piscataway, 1996), p. 2747
M. Albert, G. Crockford, S. Fartoukh, M. Giovannozzi, E.H. Maclean, A. MacPherson, R. Miyamoto, L. Ponce, S. Redaelli, H. Renshall, F. Roncarolo, R. Steinhagen, E. Todesco, R. Tomás, W. Venturini Delsolaro, Proceedings of the Third International Particle Accelerator Conference, ed. by C. Eyberger, F. Zimmermann. First Experimental Observations from the LHC Dynamic Aperture Experiment (2012), p. 1362
M. Giovannozzi, S. Cettour Cave, R. De Maria, M. Ludwig, A. Macpherson, S. Redaelli, F. Roncarolo, M. Solfaroli Camillocci, W. Venturini Delsolaro, Proceedings of the Fourth International Particle Accelerator Conference, ed. by Z. Dai, C. Petit-Jean-Genaz, V.R.W. Schaa, C. Zhang. Experimental Observations from the LHC Dynamic Aperture Machine Development Study in 2012 (2013), p. 2606
E.H. Maclean, M. Giovannozzi, R.B. Appleby, An innovative method to measure the extent of the stable phase-space region of proton synchrotrons. Phys. Rev. Accel. Beams 22, 034002 (2019)
W. Höfle, R. Assmann, S. Redaelli, R. Schmidt, D. Valuch, D. Wollmann, M. Zerlauth, Proceedings of the Third International Particle Accelerator Conference, ed. by C. Eyberger, F. Zimmermann. Controlled Transverse Blow-up of High-Energy Proton Beams for Aperture Measurements and Loss Maps (2012), p. 4059
E.H. Maclean, F. Carlier, M. Giovannozzi, R. Tomás, Report from LHC MD 2171: dynamic aperture at 6.5 TeV, CERN-ACC-Note-2018-0054
G. Trad, Private Communication (2019)
A. Bazzani, O. Mazzarisi M. Giovannozzi, E.H. Maclean, in Nonlinear Dynamics and Collective Effects in Particle Beam, ed. by S. Chattopadhyay, M. Cornacchia, S. Di Mitri, Diffusion in Stochastically Perturbed Hamiltonian Systems with Applications to the Recent LHC Dynamic Aperture Experiments (World Scientific, 2019), p. 70
S. Fartoukh, M. Giovannozzi, Dynamic aperture computation for the as-built CERN Large Hadron Collider and impact of main dipoles sorting. Nucl. Instrum. Methods A 671, 10 (2012)
M.F. Mestre, A. Bazzani, P.M. Cincotta, C.M. Giordano, Stochastic approach to diffusion inside the chaotic layer of a resonance. Phys. Rev. E 89, 012911 (2014)
A. Bazzani, L. Beccaceci, L. Bigliardi, G. Turchetti, Fokker–Planck solutions for action diffusion in a noisy symplectic map. AIP Conf. Proc. 395, 109 (1997)
Acknowledgements
We would like to express our warm gratitude to the OP and BI groups of the CERN Beams department for the support lent to the experimental studies. One of the authors (AB) would like to thank the CERN Beams department for the hospitality during the work.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A: Constructing the diffusion equation
The existence of a small positive Ljapounov exponent allows to approximate the orbit’s diffusion by means of a stochastic process [40]. Under these conditions, let us consider a generic two-dimensional perturbed Hamiltonian
where \(\Vert H_1(\theta ,I)\Vert \) fulfils Nekhoroshev’s estimate in a region of phase space where the dynamics is weakly chaotic. The corresponding phase flow satisfies
where \(\varOmega =\partial H_0/\partial I\) is the phase advance and T represents a time scale sufficiently long with respect to the decorrelation time scale of chaotic orbits, i.e. T is related to the reciprocal of the smallest Ljapounov exponent. We introduce a small parameter \(\varepsilon =\Vert H_1\Vert \) as the \(L^2\)-norm of \(H_1\) and we define a new perturbation obtained by normalising the original one, namely \({{\hat{H}}}_1=H_1/\Vert H_1\Vert \). In the limit \(\varepsilon \ll 1\) the action dynamics is approximated by a diffusion process [40] at the action-diffusion time scale
whereas the angle variables tend to relax in a time scale of the order of [41]
In the analytic case, if \(\varepsilon \ll 1\) and \(\partial \varOmega /\partial I\) is bounded from below by a quantity O(1), we have \(T_\theta \ll T_I\) and the random-phase approximation is justified.
Then, the diffusion coefficient for the action dynamics is defined as
and the actions dynamics is approximated by the stochastic differential equation
where summation over repeated indexes has been assumed and \(dw_l(\tau ), \, 1 \le l \le n\) are n independent increments of Wiener processes and \(\tau =\Vert H_1\Vert ^2 t=\varepsilon ^2 t\) is the action-diffusion slow time. The physical meaning of the stochastic dynamics (14) is that the weakly chaotic dynamics can be approximated by a stochastic process at the action-diffusion time scale, where the different realisations depend on the initial conditions, and the action diffusion is so slow that one has a continuous relaxation of the angles variables [see Eq. (12)] justifying an averaging principle.
The evolution of a distribution function \(\rho _0(I)\) is described by a Fokker–Planck (FP) equation of the form
where \(\rho (I,0)=\rho _0(I)\) and boundary conditions of Eq. (15) are introduced as absorbing boundaries at \(I=I_{\mathrm{abs}}\). In this framework, the boundary conditions may represent both physical barriers at a given amplitude, beyond which particles are lost, as well as the frontier of the weakly chaotic layer where the diffusive approach is fully justified, beyond which fast escape to infinity occurs. It is worthwhile noting that a natural boundary condition at \(I=0\) exists, since in the considered model the following limit holds
In principle, to describe the diffusion behaviour of non-linear betatronic motion in a circular particle accelerator one needs two action variables \((I_x,I_y)\) that define the non-linear invariants. However, if the diffusion process takes place mainly along a one-dimensional direction, then an approach based on a one degree-of-freedom FP equation is well justified and Eq. (15) reduces to the simpler form
where
with an absorbing barrier at \(I=I_{\mathrm{abs}}\).
According to the Nekhoroshev-like estimate for the perturbation term in the Hamiltonian [cfr. Eq. (6) in the paper], we assume
where \(I_*\) depends on the non-linear terms and the exponent \(\kappa \) is related to the dimensionality of the system and c is a normalising constant according to Eq. (7). In the sequel we set \(\alpha =1/(2\kappa )\) to simplify the notation.
We are interested in the evolution of an initial distribution when \(I_{\mathrm{abs}}\ll I_*\). The main problem is to estimate the probability current at the location of the boundary condition and its dependence on the diffusion coefficient. The FP equation (17) can be associated to a stochastic differential equation of the form
Introducing the adimensional action \(u=I/I_*\) we have
The change of variable
reduces the stochastic differential equation (19) to the form
with an absorbing boundary condition at \(x=0\) and the drift coefficient given by
which represents an external forcing towards the boundary that vanishes when \(u\rightarrow 0\) (i.e. \(x\rightarrow -\infty \)) and it can be associated to the potential V(x).
In the next section we estimate the probability current lost at the absorbing barrier by studying the spectral properties of the Smoluchowski equation
when the drift coefficient can be approximated by a constant force and the diffusion coefficient \({\mathcal {D}}\) is constant.
Appendix B: Estimate of the probability current at an absorbing boundary for the Smoluchowski equation
Let us consider the Smoluchowski equation
it is possible to cast the diffusion operator in a self-adjoint form by introducing the distribution p(x, t) such that
which satisfies the self-adjoint diffusion equation
The solution of Eq. (26) can be expanded in the form
where
and \(\phi _\lambda (x)\) are the eigenfunctions of Eq. (26) that satisfy
where
Thanks to the orthogonality and completeness properties, which are expressed by
and using the initial conditions
one obtains that
Whenever \(\rho (x,0)=\delta (x-x_0)\) the solution reads
If we set an absorbing boundary condition at \(x=0\), i.e. \(\rho (0,t)=0\), then we require \(\phi _\lambda (0)=0\) and the probability current at the boundary reads
Let us consider the special case \(V(x)=-\nu x\), i.e. a constant drift towards the boundary with an absorbing condition at \(x=0\), then \(a(x)=\nu ^2/(4\, {\mathcal {D}}) \) and the self-adjoint operator
is negative defined, so that all eigenvalues satisfy \(\lambda \le 0\). Indeed, \({\mathcal {D}}\, \partial ^2/\partial x^2\) is also a negative defined symmetric operator and its spectrum (except for the zero eigenvalue) is bounded from above by \(-\nu ^2/(4\, {\mathcal {D}})\). Let \(-\lambda \) be the eigenvalue, the eigenvector equation reads
with the boundary condition \(\phi _\lambda (0)=0\). If we set
we have non-trivial solutions given by
The zero eigenvalue corresponds to a trivial solution and we have an upper limit for the negative eigenvalues from the condition
\(\lambda _{\mathrm{min}}^{-1}\) defines the decaying characteristic time for the Fiedler’s eigenvector. Therefore, the existence of a constant drift field implies an upper bound to the Fiedler’s eigenvalue.
As an example, we compute the probability current at the absorbing barrier \(x=0\) for the case \(\nu =0\) so that \(\phi _\lambda (x)\propto \sin \sqrt{\lambda /{\mathcal {D}}} \, x\). From Eq. (29) with \(\delta (x-x_0)\) as initial condition and for a continuous spectrum, we have
where \(\lambda =\mu ^2\, {\mathcal {D}}\). The integral reduces to a Gaussian integral since
and
Then we obtain
The presence of a drift field modifies the previous calculation and if we define \(\lambda -\nu ^2/(4\, {\mathcal {D}})=k_\lambda ^2=\mu ^2\, {\mathcal {D}}\) we have the relation
from which we get the Gaussian
and using the defintion (25) the probability current reads
In this case, i.e. for a \(\delta \)-function initial distribution and a linear drift field, the probability current is described by a Gaussian function moving towards the absorbing barrier and which is diffusing. In adiabatic regime it is possible to approximate the probability current for a slowly varying drift field according to
where \(\varPhi ^t(x_0)\) is the phase flow of the drift force, namely
For a generic initial distribution \(\rho _0(x)\) we have
Appendix C: Estimate of the probability current at an absorbing barrier with a Nekhoroshev-like diffusion coefficient
For a Nekhoroshev-like estimate of the diffusion coefficient, we perform the change of variable [cfr. Eq. (21)]
and we use the diffusion-scaled time \(\tau \rightarrow \tau \, c /I_*^2\). The drift field for the new FP equation is then given by
for an initial distribution centred at \(u_0\), where an average field approximation has been assumed. Then from Eq. (35) we get the expression for the outgoing probability current in the initial time
where
\(I_0\) and \(x_0(I_0)\) define the initial distribution \(\delta (I-I_0)\) in the action variable and the x variable, respectively [cfr. Eq. (21)]. The expression (40) underestimates the probability current at the absorbing boundary since the drift term (23) is a positive-increasing function if the initial condition satisfies \(I_0\ll I_*\). Indeed, a crude estimate gives
in the case of a Nekhoroshev-like diffusion coefficient. Moreover, we expect that the probability current decreases exponentially in the initial condition \(I_0\)
The time to lose a fixed probability fraction \(\varDelta P\) starting from an initial condition \(\delta (I-I_0)\) is computed by integrating the probability current, namely
where we neglect the drift-term contribution, which is justified by the exponential decaying of the drift field (39), and we use the auxiliary variable x defined as
Then we obtain a relationship between \(\varDelta P\) and \(x_{\varDelta P}\), i.e.
and we estimate the dynamic aperture timescale \(t_{\mathrm{DA}}\) from the definition (43), letting \(x_{\varDelta P} \simeq 1\) in Eq. (44)
Therefore \(t_{\mathrm{DA}}\) turns out to be exponentially long as we change the initial condition \(I_0\) in agreement with the tracking results [17, 18].
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Bazzani, A., Giovannozzi, M. & Maclean, E.H. Analysis of the non-linear beam dynamics at top energy for the CERN Large Hadron Collider by means of a diffusion model. Eur. Phys. J. Plus 135, 77 (2020). https://doi.org/10.1140/epjp/s13360-020-00123-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epjp/s13360-020-00123-2