1 Introduction

1.1 Background and the Mean Field Limit Approach

Systems that involve many elements, be it a gas of particles or a herd of animals, are ubiquitous in our day to day lives. It is no wonder, then, that we are fascinated with their investigation and try to model and investigate the phenomena that define and evolve such systems.

Historically, we have three possible approaches to consider when dealing with such systems:

Microscopic approach in which we consider every element as an individual and find their tracjectories by solving a (most likely than not) coupled system of ODEs. This approach is the most accurate of the three, but also the most untenable due to the difficulty in solving such high number of coupled equations.

Macroscopic approach in which we “zoom” out, both in space and time, and investigate the resulting “fluid”. This method gets rid of statistically insignificant phenomena which we won’t see in the behaviour of the ensemble as a whole. The equations we consider in this case describe the evolution of the (physical) density of the resulting fluid.

Mesoscopic approach which combines the “best of both worlds” from the previous two approaches. The mesoscopic approach considers an average element of the system and how it evolves, trying to keep the microscopic picture while considering only statistically significant phenomena.

The mesoscopic approach was first introduced around the late 19th century during the golden age of the mathematical and physical investigation of the kinetic theory of gases. It has since outgrown its initial setting and is now used to describe a plethora of physical, chemical, biological and even societal and economical phenomena.

While nowadays we have many tools to solve mesoscopic equations, which are usually non-linear by nature, one of the main problems we encounter when dealing with these equations is the question of their relationship to the (more established) microscopic setting. A prime example to this issue, and what is now known as Hilbert’s 6th problem, is the question of whether or not one can show that the famous Boltzmann equation can be attained from the equations describing the motion of particles in a dilute gas. While a partial solution to this question was given in the 1975 work of Lanford [11], a result that was recently revisited in the work of Gallagher et al. [8], we are still lacking a full answer. The search for an answer to this problem, however, helped pave the way to a new and extremely potent idea—the idea of mean field limits.

In his 1956 work, [10], Mark Kac has suggested a different approach to tackle the issue of the validity of the Boltzmann equation. Kac has proposed to provide a probabilistic justification to it, instead of an exact derivation, by considering the evolution of an “average” model of a dilute gas that consists of N particles which undergo binary collisions.

Mathematically, Kac’s model (or Kac’s walk) is a jump process which describes the evolution of the probability density of an ensemble of particles. The symmetric probability density of the ensemble,Footnote 1\(F_N\), which is defined on \(\left( \mathbb {S}^{N-1}\left( \sqrt{N} \right) ,d\sigma _N \right) \) where \(d\sigma _N\) is the uniform probability measure on the \((N-1)\)-dimensional sphere of radius \(\sqrt{N}\), \(\mathbb {S}^{N-1}\left( \sqrt{N} \right) \), satisfies the so-called master equation

$$\begin{aligned} \partial _t F_N \left( \varvec{V}_N,t \right) =\mathcal {L}_N F_N \left( \varvec{V}_N ,t\right) = N(\mathcal {Q}-I)F_N\left( \varvec{V}_N ,t\right) , \end{aligned}$$

where \(\varvec{V}_N=\left( v_1,\dots ,v_N \right) \in \mathbb {S}^{N-1}\left( \sqrt{N} \right) \) and the collision operator, \(\mathcal {Q}\), is given by

$$\begin{aligned} \mathcal {Q}F\left( \varvec{V}_N \right) =\frac{1}{\left( \begin{array}{ll} N \\ 2 \end{array}\right) }\sum _{i<j}\frac{1}{2\pi } \int _{0}^{2\pi }F_N\left( R_{i,j,\theta }\left( \varvec{V}_N \right) \right) d\theta , \end{aligned}$$

with

$$\begin{aligned} \left( R_{i,j,\theta }\left( \varvec{V}_N \right) \right) _{l}={\left\{ \begin{array}{ll} v_l &{} l\not =i,j,\\ v_i\left( \theta \right) =v_{i}\cos \left( \theta \right) +v_{j}\sin \left( \theta \right) , &{} l=i,\\ v_j\left( \theta \right) =-v_{i}\sin \left( \theta \right) +v_{j}\cos \left( \theta \right) , &{} l=j. \end{array}\right. } \end{aligned}$$
(1)

Boltzmann’s equation, Kac’s surmised, should arise as a limit, in some sense, of the evolution equation for the first marginal of \(F_N\), \(F_{N,1}\), which represents the behaviour of an average particle in the system. A simple calculation shows that

$$\begin{aligned} \partial _t F_{N,1}(v) =\frac{1}{\pi } \int _{-\pi }^{\pi }\int _{\mathbb {R}} \left( F_{N,2}\left( v(\theta ),w(\theta ) \right) -F_{N,2}(v,w)\right) dwd\theta , \end{aligned}$$
(2)

where \(v\left( \theta \right) \) and \(w\left( \theta \right) \) are given by the same formula as that which defines \(v_i\left( \theta \right) \) and \(v_j\left( \theta \right) \) in (1). Equation (2) is not very surprising as we expect that the evolution of an average particle will be affected by its interaction with another particle, represented by the second marginal \(F_{N,2}\). Equation (2) is not closed, and if one attempts to find the equation for \(F_{N,2}\) they will find that it depends on the third marginal, \(F_{N,3}\). One can continue this way and find the so-called BBGKYFootnote 2 hierarchy, which ends with the original master equation.

At this point in his analysis Kac introduced a truly novel idea which was inspired by the original work of Boltzmann. Kac realised that the model we discussed above didn’t fully take into account the fact that the gas we are considering is dilute. The dilutness implies that we expect that any two given particles have very small chance to collide with one another and the more particles we have in the system—the smaller the chance is. Intuitively speaking, what we expect is that as N increases the particles become more and more independent. In other word, for any fixed \(k\in \mathbb {N}\) we have that the \(k-\)th marginal of \(F_N\), \(F_{N,k}\), which represents the behaviour of a group of k random particles, will become more tensorised with respect to a limiting function, f, which represents the limiting behaviour of one average particle:

$$\begin{aligned} {\begin{matrix} &{}F_{N,1}(v_1)\underset{N\text { large}}{\approx } f(v_1),\\ &{}F_{N,2}(v_1,v_2)\underset{N\text { large}}{\approx } f(v_1)f(v_2), \\ &{}\vdots \\ &{}F_{N,k}(\varvec{V}_k)\underset{N\text { large}}{\approx } f^{\otimes k}\left( \varvec{V}_k \right) . \end{matrix}} \end{aligned}$$

Kac has defined the above property, which we now call (molecular) chaos or chaoticity, rigorously. This new notion provided Kac with the “closure condition” needed to take a limit in (2). Kac has shown that his model remains chaotic if it starts as such, which is known as propagation of chaos, and that the generating function for the evolved probability density satisfies the famous Boltzmann-Kac equation

$$\begin{aligned} \partial _t f(v) = \frac{1}{\pi }\int _{-\pi }^{\pi }\int _{\mathbb {R}} \left( f(v(\theta ))f(w(\theta ))-f(v_1)f(v_2) \right) dwd\theta , \end{aligned}$$

in the limit when N goes to infinity. While Kac’s original model only considered the case where the velocities of the particles in the ensemble are assumed to be one dimensional, the above has been extended to higher dimensions and more realistic models where the resulting mean field equation is precisely the Boltzmann equation (see, for instance, [12]).

Kac’s model and approach have had ramification beyond their immediate success—ushering the field of mean field models and limits. We notice that his procedure relied on exactly two ingredients:

  • An average model for a system of interacting elements. In our context this is an evolution equation for the probability density of the ensemble of elements.Footnote 3

  • An asymptotic correlation relation. This relation expresses the emerging phenomena we expect to get as the number of elements goes to infinity. For Kac’s model this relation was chaoticity.

The simplicity of the above approach, sometimes called the mean field limit approach, opened the flood gate to the investigation of various many element models which, in recent decades, permeated into the realms of biology, chemical interactions and even sociology—with examples which include swarming of animals, neural networks, and consensus amongst people (see [1, 2, 4] as well as the review paper [7] and references within).

It may come as a surprise that while the mean field limit approach is used in various settings, the only asymptotic correlation used to this day is that of chaoticity. This, however, doesn’t seem appropriate in many biological and societal situations where we expect more dependence than independence between the underlying elements. This suspicion has been confirmed in recent works of Carlen et al. [5, 6] who have constructed an animal based model which, after appropriate scaling, deviates from chaoticity. The need for a different type of asymptotic correlation is the beginning of this work.

1.2 Chaos, Order, and Choose the Leader Model

We start this subsection by describing the Choose the Leader model, or CL model in short, introduced in the works of Carlen et al. [5, 6]. This model will motivate our definition of a new asymptotic correlation relation—order.

The CL model is, similarly to Kac’s model, a velocity based pair-interaction jump process that describes the evolution of a system that revolves around a herd of animals or a biological swarm.

The model consists of N animals who move in a planar domain. The velocity of each individual is assumed to be of magnitude 1 and as such can be considered to be an element of the circle \(\mathbb {S}^1\). At a random time, given by a Poisson stream with a rate \(\lambda >0\), a pair of animals is chosen at random uniformly amongst all the animals and undergoes a “collision”: one of the animals, again chosen at random uniformly between the two, adapts its velocity to the second animal up to a small amount of “noise”. Mathematically, this means that if the i-th and j-th animal interacted and the j-th animal decided to follow the i-th animal, we have that post collision

$$\begin{aligned} \left( v_i,v_j \right) \longrightarrow \left( v_i,Zv_i \right) , \end{aligned}$$

where Z is an independent random variable with values on \(\mathbb {S}^1\) and a given density function g, and where we have used the notation vw to indicate the velocity \(e^{i\left( \text {Arg}(v)+\text {Arg}(w) \right) }\), considering elements in \(\mathbb {S}^1\) to be of the form \(e^{i\theta }\).

Following on the above convention on \(\mathbb {S}^1\) we can replace the velocity variables with their respective angle on the circle and conclude that the state space of the model is the N-dimensional torus, \(\mathcal {T}^N=\left[ -\pi ,\pi \right] ^N\) (with the appropriate identification of the end points of the intervals), and that the master equation of the above process, i.e. the equation for the probability density of the ensemble on \(\mathcal {T}^N\) with respect to the underlying probability measure \(\frac{d\theta _1\dots d\theta _N}{\left( 2\pi \right) ^N}\), is given by

$$\begin{aligned} {\begin{matrix} \partial _t F_N\left( \theta _1,\dots ,\theta _N \right) &{} = \frac{2\lambda }{N-1}\sum _{i<j}\Bigg \{\frac{g\left( \theta _i-\theta _j \right) }{2}\Bigg ( \left[ F_N \right] _{\widetilde{j}}\left( \theta _1,\dots ,\widetilde{\theta }_{j},\dots ,\theta _N \right) \\ &{}\quad +\left[ F_N \right] _{\widetilde{i}}\left( \theta _1,\dots ,\widetilde{\theta }_{i},\dots ,\theta _N \right) \Bigg ) -F_N\left( \theta _1,\dots ,\theta _N \right) \Bigg \}. \end{matrix}} \end{aligned}$$
(3)

with

$$\begin{aligned} \left[ F_N \right] _{\widetilde{j}}\left( \theta _1,\dots ,\widetilde{\theta }_{j},\dots ,\theta _N \right) =\int _{-\pi }^{\pi }F_N\left( \theta _1,\dots ,\theta _N \right) \frac{d\theta _j}{2\pi }. \end{aligned}$$

where we have used the notation \(\left( \theta _1,\dots ,\widetilde{\theta }_{j},\dots ,\theta _N \right) \) for the \((N-1)\)-dimensional vector which is attained by removing \(\theta _j\) from the original N-dimensional vector \(\left( \theta _1,\dots ,\theta _N \right) \). We will continue and use this notation throughout this paper.

From the description of the CL model it seems that as times passes more meetings between the animals of the herd will happen and consequently greater overall mutual adherence will be observed. The emergence of these correlation, however, may strongly depend on the number of animals. Indeed, the more animals we have the less likely it is that any two given animals will meet—increasing the time we’ll have to wait before we see any emerging pattern.

In their two papers [5, 6] Carlen et al. have addressed this issue. They showed that chaos does propagate on every fixed time interval, but is broken when we rescale our time variable as well as the noise intensity g. While seemingly odd, we shouldn’t be surprised that the deviation from the adherence of the velocities may also depend on the number of the animals when we think of biological/societal settings—it can be, for example, that the more animals we have, the more anxious they get and consequently they align themselves more closely when they meet.

This intuitive idea of adherence motivates our upcoming definition of order (Definition 5) but before we move to it, and for the sake of completeness, we remind the reader the general definition of chaoticity:

Definition 1

Let \(\mathcal {X}\) be a Polish space. We say that a sequence of symmetric probability measures, \(\mu _N\in \mathscr {P}\left( \mathcal {X}^N \right) \) with \(N\in \mathbb {N}\), is \(\mu _0-\)chaotic for some probability measure \(\mu _0\in \mathscr {P}\left( \mathcal {X} \right) \) if for any \(k\in \mathbb {N}\)

$$\begin{aligned} \Pi _k\mu _{N} \underset{N\rightarrow \infty }{\overset{\text {weak}}{\longrightarrow }}\mu _0^{\otimes k} \end{aligned}$$

where \(\Pi _k \mu _N\) is the \(k-\)the marginal of \(\mu _N\). The weak convergence in the above refers to convergence when integrating against bounded continuous functions.

It is worth to mention at this point that there are various notions of chaoticity. We refer the interested reader to [9] for more information.

Carlen, Degond and Wennberg have shown the propagation of chaos in general pair-interaction models in [6]. In particular they have proved the following:

Theorem 2

Assume that \(\left\{ F_{N}(0)\right\} _{N\in \mathbb {N}}\) is f-chaotic. Then for any \(t>0\) the solution to the CL master equation (3) with initial datum \(\left\{ F_{N}(0)\right\} _{N\in \mathbb {N}}\), \(\left\{ F_{N}(t)\right\} _{N\in \mathbb {N}}\), is f(t)-chaotic. Moreover, \(f\left( t \right) \) satisfies the equation

$$\begin{aligned} \partial _t f\left( \theta ,t \right) = \left( g*f \right) \left( \theta ,t \right) - f\left( \theta ,t \right) . \end{aligned}$$

As was mentioned before, the breaking of chaoticity is achieved by rescaling the time and intensity of the interaction in (3). The time would naturally be rescaled by a factor of N to guarantee that in a (rescaled) unit time all pairs of animals have interacted once. The scaling of the interaction, on the other hand, is motivated from a standard scaling on the line—restricted to \(\left[ -\pi ,\pi \right] \):

Definition 3

Given a symmetric probability density on \(\mathbb {R}\) with respect to the Lebesgue measure dx, g, and a scaling parameter \(\epsilon >0\) we define the rescaled and restricted probability density on \(\mathcal {T}\) with respect to the underlying probability measure \(\frac{d\theta }{2\pi }\), \(g_\epsilon \), by

$$\begin{aligned} g_\epsilon \left( \theta \right) = \frac{1}{\epsilon \widetilde{g}_{\epsilon }}g\left( \frac{\theta }{\epsilon } \right) \end{aligned}$$

where

$$\begin{aligned} \widetilde{g}_{\epsilon } =\frac{1}{2\pi }\int ^{\frac{\pi }{\epsilon }}_{-\frac{\pi }{\epsilon }}g(x)dx. \end{aligned}$$

We will assume from this point onwards that the probability density of our interaction in the CL model is of the form described above and that its “generator”, g, is a symmetric probability density with at least a finite third moment.

To simplify the presentation of what is to follow we will write \(f\in \mathscr {P}\left( \mathcal {X},\mu \right) \) when f is a probability density on \(\mathcal {X}\) with respect to the underlying measure \(\mu \). We will shorten the above notation and say that \(f\in \mathscr {P}\left( \mathcal {X} \right) \) when \(\mu \) is clear from the setting. In the remainder of our work we will consider the spaces \(\mathcal {T}^k\) with the inherent measure \(\frac{d\theta _1\dots d\theta _k}{\left( 2\pi \right) ^k}\), where \(k\in \mathbb {N}\).

Following the time rescaling \(t^\prime =\frac{t}{N}\) (which we will still denote as t) and allowing the interaction scaling parameter to depend on N, i.e. considering \(\epsilon =\epsilon _N\) in Definition 3, we attain the general rescaled CL master equation:

$$\begin{aligned} {\begin{matrix} \partial _t F_N\left( \theta _1,\dots ,\theta _N \right) =&{} \frac{2\lambda N}{N-1}\sum _{i<j}\Bigg \{\frac{g_{\epsilon _N}\left( \theta _i-\theta _j \right) }{2}\Bigg ( \left[ F_N \right] _{\widetilde{j}}\left( \theta _1,\dots ,\widetilde{\theta }_{j},\dots ,\theta _N \right) \\ &{}+\left[ F_N \right] _{\widetilde{i}}\left( \theta _1,\dots ,\widetilde{\theta }_{i},\dots ,\theta _N \right) \Bigg ) -F_N\left( \theta _1,\dots ,\theta _N \right) \Bigg \}. \end{matrix}} \end{aligned}$$
(4)

Carlen, Chatelin, Degond, and Wennberg have shown the following in [5]:

Theorem 4

Consider the rescaled CL master Eq. (4) with \(\epsilon _N=\frac{1}{\sqrt{N}}\) and let \(\left\{ F_{N}(t)\right\} _{N\in \mathbb {N}}\) be the family of their solutions. If \(\left\{ F_{N,k}\left( t \right) \right\} _{N\in \mathbb {N}}\) converges weakly to a family \(\left\{ f_k(t)\right\} _{k\in \mathbb {N}}\) when N goes to infinity for any \(k\in \mathbb {N}\) and \(t>0\) then \(\left\{ f_k(t)\right\} _{k\in \mathbb {N}}\) is not chaotic, i.e. \(f_k(t) \not = f_1^{\otimes k}(t)\) for \(k\ge 2\).

From the construction of the model and the discussion above we are not too surprised by this result—asymptotic independence is not what we expect when the animals try to adhere to one another. What we do expect, in a sense, is that if we allow the correlation to reach their full potential then the entire herd moves in a single direction following a random leader. This motivates the following new definition:

Definition 5

Let \(\mathcal {X}\) be a Polish space. We say that a sequence of symmetric probability measures, \(\mu _N\in \mathscr {P}\left( \mathcal {X}^N \right) \) with \(N\in \mathbb {N}\), is \(\mu _0-\)ordered for some probability measure \(\mu _0\in \mathscr {P}\left( \mathcal {X} \right) \) if for any \(k\in \mathbb {N}\)

$$\begin{aligned} \Pi _k\left( d\mu _{N} \right) \left( \theta _1,\dots ,\theta _k \right) \underset{N\rightarrow \infty }{\overset{\text {weak}}{\longrightarrow }}d\mu _0\left( \theta _1 \right) \prod _{i=1}^{k-1} \delta _{\theta _i}\left( \theta _{i+1} \right) \end{aligned}$$
(5)

where \(\delta _{a}\left( \cdot \right) \) is the delta measure concentrated at the point a. When \(\mu _0\) has a density function f with respect to an underlying measure on \(\mathcal {X}\), \(\mu \) (i.e. when \(d\mu _0\left( \theta \right) = f\left( \theta \right) d\mu \left( \theta \right) \)), we will say that the sequence \(\left\{ \mu _N\right\} _{N\in \mathbb {N}}\) is \(f-\)ordered and simplify (5) by writing

$$\begin{aligned} \Pi _k\left( d\mu _{N} \right) \left( \theta _1,\dots ,\theta _k \right) \underset{N\rightarrow \infty }{\overset{\text {weak}}{\longrightarrow }}f\left( \theta _1 \right) \prod _{i=1}^{k-1} \delta _{\theta _i}\left( \theta _{i+1} \right) . \end{aligned}$$

Remark 6

Since

$$\begin{aligned} \prod _{i=1}^{k-1} \delta _{\theta _i}\left( \theta _{i+1} \right) =\prod _{i=2}^{k} \delta _{\theta _1}\left( \theta _i \right) \end{aligned}$$

we can reformulate Definition 5 by requiring that

$$\begin{aligned} \Pi _k\left( d\mu _{N} \right) \left( \theta _1,\dots ,\theta _k \right) \underset{N\rightarrow \infty }{\overset{\text {weak}}{\longrightarrow }}d\mu _0\left( \theta _1 \right) \prod _{i=2}^{k} \delta _{\theta _1}\left( \theta _i \right) . \end{aligned}$$

This formalisation of order highlights a bit more the concentration of the limit of \(\Pi _k\left( d\mu _{N} \right) \) on the diagonal. Additionally, if \(\mathcal {X}\) also has a group operation, which we will denote by \(+\), we can rewrite (5) as

$$\begin{aligned} \Pi _k\left( d\mu _{N} \right) \left( \theta _1,\dots ,\theta _k \right) \underset{N\rightarrow \infty }{\overset{\text {weak}}{\longrightarrow }}d\mu _0\left( \theta _1 \right) \prod _{i=1}^{k-1} \delta \left( \theta _{i+1}-\theta _i \right) \end{aligned}$$
(6)

where \(\delta \) is the delta measure concentrated at 0. This is the case in our setting where \(\mathcal {X}=\mathcal {T}\) with the underlying measure \(\frac{d\theta }{2\pi }\) and we will use this notation from this point onwards.

Remark 7

As we are starting to mix between singular measures and probability densities we may encounter notational issues. To simplify the presentation of this work, we will keep using a density based notation with the understanding that

$$\begin{aligned} \int _{\mathcal {T}}h(\theta )\delta \left( \theta -\varphi \right) \frac{d\theta }{2\pi } = h\left( \varphi \right) \end{aligned}$$

for all appropriate measurable functions.

Much like when considering the notion of chaoticity, an immediate question one must ask is whether or not there are any ordered states. The answer to that is in the affirmative. Given a Polish space \(\mathcal {X}\) and \(\mu _0\in \mathscr {P}\left( \mathcal {X} \right) \) we can define the family

$$\begin{aligned} d\mu _N \left( \theta _1,\dots ,\theta _N \right) =d \mu _0\left( \theta _1 \right) \prod _{i=1}^{N-1} \delta _{\theta _i}\left( \theta _{i+1} \right) \in \mathscr {P}\left( \mathcal {X}^N \right) \end{aligned}$$

whose marginals clearly satisfy

$$\begin{aligned} \Pi _k\left( d\mu _N \right) \left( \theta _1,\dots ,\theta _k \right) = d\mu _0\left( \theta _1 \right) \prod _{i=1}^{k-1} \delta _{\theta _i}\left( \theta _{i+1} \right) . \end{aligned}$$

This shouldn’t come as a great surprise: since our notion or order speaks of an asymptotic concentration along the diagonal, choosing a family that already has this property produces an ordered state (this is, in a sense, equivalent to choosing a tensorised family of states in the chaotic setting).

It is worth to note that since

$$\begin{aligned} d\mu _0\left( \theta _1 \right) \prod _{i=1}^{N-1} \delta _{\theta _i}\left( \theta _{i+1} \right) =\frac{1}{N}\sum _{j=1}^Nd\mu _0\left( \theta _j \right) \prod _{i+1\not =j} \delta _{\theta _i}\left( \theta _{i+1} \right) \end{aligned}$$

our family \(\left\{ \mu _N\right\} _{N\in \mathbb {N}}\) is indeed symmetric.

Our goal in this work is to explore the newly defined notion of order and show that it is the right asymptotic correlation relation for the rescaled CL model, at least when the interaction is strong enough. Moreover, we will show that this notion propagates.

1.3 Main Results

As we’ve mentioned in the previous subsection, in order to see an emergence of a non-chaotic phenomenon we need to rescale the time and the intensity of the underlying interactions in the process. While the works of Carlen et al. discuss a specific choice of scaling intensity \(\epsilon _N\), we have, in fact, three different possibilities.

To see these possibilities more clearly, let us consider the evolution equation for the first marginal. A simple integration of (4) together with the fact that for symmetric density functions

$$\begin{aligned} \left[ F_N \right] _{\widetilde{j}}\left( \theta _1,\dots ,\widetilde{\theta }_{j},\dots ,\theta _N \right) =F_{N,N-1}\left( \theta _1,\dots ,\widetilde{\theta }_{j},\dots ,\theta _N \right) \end{aligned}$$

shows that the evolution of the k-th marginals, with \(k=1,\dots ,N\), is given by the following BBGKY hierarchy

$$\begin{aligned} {\begin{matrix} &{}\partial _t F_{N,k}\left( \theta _1,\dots ,\theta _k \right) \\ &{}\quad =\frac{2\lambda N}{N-1}\sum _{i<j\le k}\Bigg \{\frac{g_{\epsilon _N}\left( \theta _i-\theta _j \right) }{2}\Bigg ( F_{N,k-1}\left( \theta _1,\dots ,\widetilde{\theta }_{j},\dots ,\theta _k \right) \\ &{}\qquad +F_{N,k-1}\left( \theta _1,\dots ,\widetilde{\theta }_{i},\dots ,\theta _k \right) \Bigg ) -F_{N,k}\left( \theta _1,\dots ,\theta _k \right) \Bigg \}\\ &{}\qquad +\frac{2\lambda N\left( N-k \right) }{N-1}\sum _{i\le k}\frac{1}{2}\Bigg \{\int _{\mathcal {T}}g_{\epsilon _N}\left( \theta _i-\theta _{k+1} \right) F_{N,k}\left( \theta _1,\dots ,\widetilde{\theta }_{i},\dots ,\theta _{k+1} \right) \frac{d\theta _{k+1}}{2\pi }\\ &{}\qquad -F_{N,k}\left( \theta _1,\dots ,\theta _k \right) \Bigg \} \end{matrix}} \end{aligned}$$
(7)

(for proof in the non-scaled case, see [5]). When \(k=1\) the above reads as

$$\begin{aligned} \begin{aligned}&\partial _t F_{N,1}\left( \theta _1,t \right) =\lambda N \left( \int _{-\pi }^{\pi }g_{\epsilon _N}\left( \theta _1-\theta \right) F_{N,1}\left( \theta ,t \right) \frac{d\theta }{2\pi } -F_{N,1}\left( \theta _1,t \right) \right) . \end{aligned} \end{aligned}$$
(8)

As our underlying space is \(\mathcal {T}=[-\pi ,\pi ]\) and the above is clearly a PDE which involves convolution, we are motivated to use Fourier analysis and see that on the Fourier side equation (8) can be rewritten as

$$\begin{aligned} \frac{d}{dt}\widehat{F_{N,1}}\left( n,t \right) = \lambda N \left( \widehat{g_{\epsilon _N}}\left( n \right) -1 \right) \widehat{F_{N,1}}\left( n,t \right) ,\qquad n\in \mathbb {Z}. \end{aligned}$$
(9)

The solution to (9) is explicitly given by

$$\begin{aligned} \widehat{F_{N,1}}\left( n,t \right) = e^{\lambda N \left( \widehat{g_{\epsilon _N}}\left( n \right) -1 \right) t}\widehat{F_{N,1}}\left( n,0 \right) ,\qquad n\in \mathbb {Z}. \end{aligned}$$
(10)

It can be shown that as long as g has a finite third moment

$$\begin{aligned} \widehat{g_{\epsilon _N}}\left( n \right) = 1 +\frac{m_2}{2}\epsilon _N^2n^2 + O\left( \epsilon _N^3\left|n\right|^3 \right) , \end{aligned}$$
(11)

where \(m_2= \int _{\mathbb {R}}x^2 g(x)dx\) which implies that

$$\begin{aligned} \widehat{F_{N,1}}\left( n,t \right) = e^{-\lambda \left( \frac{m_2}{2}\left( N\epsilon _N^2 \right) n^2+ O\left( N\epsilon _N^3 n^3 \right) \right) t}\widehat{F_{N,1}}\left( n,0 \right) ,\qquad n\in \mathbb {Z}. \end{aligned}$$

The above gives rise to three scaling options:

  1. (i)

    \(\underline{N\epsilon _N^2 \underset{N\rightarrow \infty }{\longrightarrow }0.}\) In this case the interaction scaling is more dominant than the time scaling. This is the case where we expect correlation to form quickly and that order will emerge.

  2. (ii)

    \(\underline{N\epsilon _N^2=1.}\) This is the case discussed in [5, 6]. The scaled interaction and time are “balanced” in a diffusive manner.Footnote 4 Interestingly, in this case order, as defined in Definition 5, is not observed as we will show shortly. As a small remark we’d like to mention that we could have replaced the condition \(N\epsilon _N^2=1\) with \(N\epsilon _N^2 \underset{N\rightarrow \infty }{\longrightarrow }C\) where \(0<C<\infty \).

  3. (iii)

    \(\underline{N\epsilon _N^2 \underset{N\rightarrow \infty }{\longrightarrow }\infty .}\) In this case the time scaling is more dominant than the interaction scaling and as a result we don’t expect correlation to form quickly enough. We expect that chaos will prevail here.

Our main results in this work concern themselves only with the first two cases as our goal is to veer away from chaoticity. Before we state our main theorems we’d like to note that the existence and uniqueness of solutions to (3) (and equivalently (4)) is immediate from the form of the evolution equation(s) and the fact that the operators which govern them are linear and bounded.Footnote 5

Theorem 8

Let \(\left\{ F_N(t)\right\} _{N\in \mathbb {N}}\) be the family of symmetric solutions to (4). Assume in addition that \(\lim _{N\rightarrow \infty }N\epsilon _N^2=0\) and that \(\left\{ F_{N,k}\left( 0 \right) \right\} _{N\in \mathbb {N}}\) converges weakly as N goes to infinity to a family \(f_k \in \mathscr {P}\left( \mathcal {T}^k \right) \) for any \(k\in \mathbb {N}\).

Then for any \(t>0\) and any \(k\in \mathbb {N}\), \(\left\{ F_{N,k}(t)\right\} _{N\in \mathbb {N}}\) converges weakly as N goes to infinity to a family \(f_k(t)\in \mathscr {P}\left( \mathcal {T}^k \right) \) which satisfies

$$\begin{aligned} \begin{aligned}&f_k\left( \theta _1,\dots ,\theta _k,t \right) \\&\quad = e^{-\lambda k\left( k-1 \right) t}f_k\left( \theta _1,\dots ,\theta _k \right) \\&\qquad +2\lambda \int _{0}^t e^{-\lambda k\left( k-1 \right) \left( t-s \right) }\left( \sum _{i<j\le k}f_{k-1}\left( \theta _1,\dots , \widetilde{\theta _i},\dots ,\theta _k,s \right) \delta \left( \theta _i-\theta _j \right) \right) ds. \end{aligned} \end{aligned}$$
(12)

In particular, we have that

$$\begin{aligned} \lim _{t\rightarrow \infty }f_{k}(\theta _1,\dots , \theta _k,t) = f_1\left( \theta _1 \right) \prod _{j=1}^{k-1}\delta \left( \theta _{i+1}-\theta _i \right) \end{aligned}$$

which is an \(f_1-\)ordered family. Moreover, if \(\left\{ F_N(0)\right\} _{N\in \mathbb {N}}\) is \(f_1-\)ordered then

$$\begin{aligned} f_k\left( \theta _1,\dots ,\theta _k,t \right) =f_1\left( \theta _1 \right) \prod _{j=1}^{k-1}\delta \left( \theta _{i+1}-\theta _i \right) \end{aligned}$$

for all \(t>0\).

Remark 9

The family of measures given by (12) is indeed a family of probability measures. To see that we notice that

$$\begin{aligned} \int _{\mathcal {T}^k}f_k\left( \theta _1,\dots ,\theta _k,t \right) \frac{d\theta _1\dots d\theta _k}{\left( 2\pi \right) ^k}=e^{-\lambda k\left( k-1 \right) t}\int _{\mathcal {T}^k}f_k\left( \theta _1,\dots ,\theta _k \right) \frac{d\theta _1\dots d\theta _k}{\left( 2\pi \right) ^k} \end{aligned}$$
$$\begin{aligned} +2\lambda \int _{0}^{t} e^{-\lambda k\left( k-1 \right) \left( t-s \right) }\left( \sum _{i<j\le k}\int _{\mathcal {T}^{k-1}}f_{k-1}\left( \theta _1,\dots , \widetilde{\theta _i},\dots ,\theta _k,s \right) \frac{d\theta _1\dots d\widetilde{\theta _{i}}\dots d\theta _k}{\left( 2\pi \right) ^{k-1}} \right) ds. \end{aligned}$$

Assuming by induction that \(f_{k-1}\left( t \right) \) is a probability measure shows that

$$\begin{aligned} \int _{\mathcal {T}^k}f_k\left( \theta _1,\dots ,\theta _k,t \right) \frac{d\theta _1\dots d\theta _k}{\left( 2\pi \right) ^k}=e^{-\lambda k\left( k-1 \right) t}+\lambda k\left( k-1 \right) \int _{0}^{t} e^{-\lambda k\left( k-1 \right) \left( t-s \right) }ds=1, \end{aligned}$$

where we used the fact that \(2\sum _{i<j\le k}1=k(k-1)\).

Remark 10

The first result of Theorem 8 tells us that no matter which weakly converging family we start with, the limit family will become ordered as time goes to infinity. We can think about this as generation of order. It is interesting to note that a phenomena of generation of chaos was also observed by Lukkarinen. More information can be found in [13].

We would also like to point out that the second result in the Theorem 8 describes the propagation of order in the CL model as it states that for any \(k\in \mathbb {N}\) and \(t>0\)

$$\begin{aligned} \lim _{N\rightarrow \infty }F_{N,k}(t) = f_1\left( \theta _1 \right) \prod _{j=1}^{k-1}\delta \left( \theta _{i+1}-\theta _i \right) . \end{aligned}$$

This capitalises on the fact that the interaction scaling is stronger then the time scaling, which is enough to imply a time independent ordered state for all \(t>0\).

Theorem 11

Let \(\left\{ F_N(t)\right\} _{N\in \mathbb {N}}\) be the family of symmetric solutions to (4). Assume in addition that \(N\epsilon _N^2=1\). Then \(\left\{ F_N(t)\right\} _{N\in \mathbb {N}} \) is neither chaotic nor ordered for any \(t>0\).

Following on Theorem 8 we might wonder if the lack of order in this setting is resolved when we allow time to go to infinity. While the next theorem answers this question in the negative, it does show that there is hope for some sort of partial order (in terms of relative concentration on the diagonal) to appear. We will discuss this a bit more in §5.

Theorem 12

Let \(\left\{ F_N(t)\right\} _{N\in \mathbb {N}}\) be the family of symmetric solutions to (4). Assume in addition that \(N\epsilon _N^2=1\) and that \(\left\{ F_{N,1}(0)\right\} _{N\in \mathbb {N}}\) and \(\left\{ F_{N,2}(0)\right\} _{N\in \mathbb {N}}\) converge weakly to \(f_1\in \mathscr {P}\left( \mathcal {T} \right) \) and \(f_2\in \mathscr {P}\left( \mathcal {T}^2 \right) \) respectively. Then for all \(t>0\) \(\left\{ F_{N,1}(t)\right\} _{N\in \mathbb {N}}\) and \(\left\{ F_{N,2}(t)\right\} _{N\in \mathbb {N}}\) converge to \(f_1(t)\in \mathscr {P}\left( \mathcal {T} \right) \) and \(f_2(t)\in \mathscr {P}\left( \mathcal {T}^2 \right) \) respectively which satisfy

$$\begin{aligned} \lim _{t\rightarrow \infty } f_1(\theta _1,t) =1, \end{aligned}$$
(13)

and

$$\begin{aligned} \lim _{t\rightarrow \infty }f_2(\theta _1,\theta _2,t) = \mathcal {H}\left( \theta _1-\theta _2 \right) \end{aligned}$$
(14)

where

$$\begin{aligned} \mathcal {H}\left( \theta \right) = \sum _{n\in \mathbb {Z}}\frac{2}{m_2 n^2+2}e^{in \theta }=1+4\sum _{n\in \mathbb {N}}\frac{\cos \left( n\theta \right) }{m_2 n^2+2}. \end{aligned}$$

Remark 13

While it is possible to find \(f_1(t)\) and \(f_2(t) \) (as we will see in the proof of the theorem), the focus of Theorem 12 is on the asymptotic behaviour with respect to time and consequently we elected to exclude formulae from the statement.

Remark 14

As can be seen in the figure below

Fig. 1
figure 1

A plot of an approximation of \(\mathcal {H}\) with \(m_2=1\) by the first 500 terms of the cosine series

\(\mathcal {H}\) is somewhat concentrated around 0, validating our intuition that some “type of order” (or partial order) phenomenon may emerge here.

1.4 The Organisation of the Paper

In Sect. 2 we will discuss some preliminaries that will help us prove our main results. Section 3 will be dedicated to the proof of Theorem 8 while Sect. 4 will focus on Theorems 11 and 12. We’ll conclude the work with some final remarks in Sect. 5 and an appendix which considers some technical details.

2 Preliminaries

Looking at the BBGKY hierarchy of our (rescaled) CL model, given by (7), we immediately notice that besides the fact that we are dealing with a closed linear hierarchy—it also involves a simple convolution term. This motivates us to use Fourier analysis in our investigation of the model, the application of which will be the focus of this short section.

In this section we will consider the following topics: the connection between weak convergence and Fourier coefficients on \(\mathcal {T}^k\) and the meaning of order in the Fourier space, the behaviour of the Fourier coefficients of \(g_{\epsilon _N}\), and the recasting of our rescaled master equation (4) in the Fourier space.

To simplify notations we will denote by \(g_N=g_{\epsilon _N}\) from this point onwards.

We start with the following simple observation, which is presented without proof.

Lemma 15

Let \(\left\{ \mu ^{(k)}_{N}\right\} _{N\in \mathbb {N}}\) be a sequence of probability measures on \(\mathcal {T}^k,\) and let \(\mu ^{(k)}\in \mathscr {P}\left( \mathcal {T}^k \right) \). Then \(\mu ^{(k)}_N\underset{N\rightarrow \infty }{\overset{\text {weak}}{\longrightarrow }}\mu ^{(k)}\) if and only if for any \(\left( n_1,\dots ,n_k \right) \in \mathbb {Z}^k\)

$$\begin{aligned} \begin{aligned} \widehat{\mu _{N}^{(k)}}\left( n_1,\dots ,n_k \right)&= \int _{\mathcal {T}^k} e^{-i\sum _{j=1}^k n_j \theta _j} d\mu _N^{(k)}\left( \theta _1,\dots ,\theta _k \right)&\underset{N\rightarrow \infty }{\longrightarrow }\widehat{\mu ^{(k)}}\left( n_1,\dots ,n_k \right) . \end{aligned} \end{aligned}$$

We would like to remind the reader that when we consider a probability density \(f_k\in \mathscr {P}\left( \mathcal {T}^k \right) \) it is always with respect to the underlying measure \(\frac{d\theta _1\dots d\theta _k}{\left( 2\pi \right) ^k}\) which means that

$$\begin{aligned} \widehat{f_k}\left( n_1,\dots ,n_k \right) =\int _{\mathcal {T}^k} f_k\left( \theta _1,\dots ,\theta _k \right) e^{-i\sum _{j=1}^k n_j \theta _j} \frac{d\theta _1\dots d\theta _k}{\left( 2\pi \right) ^k}, \end{aligned}$$

as expected.

2.1 The Meaning of Order in the Fourier Space

Following on Lemma 15 we want to find out how an ordered state looks like in the Fourier space:

Lemma 16

The family \(F_N\in \mathscr {P}\left( \mathcal {T}^N \right) \), with \(N\in \mathbb {N}\), is \(f-\)ordered if and only if for any \(\left( n_1,\dots ,n_k \right) \in \mathbb {Z}^k\)

$$\begin{aligned} \begin{aligned} \widehat{F_{N,k}}\left( n_1,\dots ,n_k \right)&\underset{N\rightarrow \infty }{\longrightarrow }\widehat{f}\left( \sum _{j=1}^k n_j \right) . \end{aligned} \end{aligned}$$

The proof of the above relies on the following simple observation:

Lemma 17

Let \(f\in \mathscr {P}\left( \mathcal {T} \right) \) and let \(k\in \mathbb {N}\). Then \(\mu \in \mathscr {P}\left( \mathcal {T}^k \right) \) satisfies

$$\begin{aligned} d\mu \left( \theta _1,\dots ,\theta _k \right) =f\left( \theta _1 \right) \prod _{i=1}^{k-1}\delta \left( \theta _{i+1}-\theta _i \right) \end{aligned}$$

if and only if \(\widehat{\mu }\left( n_1,\dots ,n_k \right) =\widehat{f}\left( \sum _{j=1}^k n_j \right) \) for any \(\left( n_1,\dots ,n_k \right) \in \mathbb {Z}^k\).

Proof

Since the Fourier coefficients of a measure determine it uniquely,

it is enough for us to show that the Fourier coefficient of \(\mu _{\text {o}}=f\left( \theta _1 \right) \prod _{i=1}^{k-1}\delta \left( \theta _{i+1}-\theta _i \right) \) at \(\left( n_1,\dots ,n_k \right) \) is \(\widehat{f}\left( \sum _{j=1}^{k}n_j \right) \). Indeed

$$\begin{aligned} \widehat{\mu _\text {o}}\left( n_1,\dots ,n_k \right) = \int _{\mathcal {T}^k}f\left( \theta _1 \right) \prod _{i=1}^{k-1}\delta \left( \theta _{i+1}-\theta _i \right) e^{-i\sum _{j=1}^k n_j\theta _j}\frac{d\theta _1\dots d\theta _k}{\left( 2\pi \right) ^k} \end{aligned}$$
$$\begin{aligned} =\int _{\mathcal {T}}f\left( \theta _1 \right) e^{-i\left( \sum _{j=1}^k n_j \right) \theta _1}\frac{d\theta _1}{2\pi } = \widehat{f}\left( \sum _{j=1}^k n_j \right) . \end{aligned}$$

\(\square \)

Proof of Lemma 16

The proof is an immediate application of Lemmas 15 and 17.

2.2 The Behaviour of the Fourier Coefficients of \(g_N\)

The penultimate ingredient we need in our investigation of (4) and to show the appearance of order is the following lemma:

Lemma 18

Let \(g\in \mathscr {P}\left( \mathbb {R},dx \right) \) be such that its k-th moment, defined as

$$\begin{aligned} m_k = \int _{\mathbb {R}}\left|x\right|^k g(x)dx, \end{aligned}$$

is finite for some \(k>2\). Then for any \(\epsilon <\frac{\pi }{\root k \of {m_k}}\) and any \(n\in \mathbb {Z}\)

$$\begin{aligned} \left|\widehat{g_\epsilon }(n) - 1 + \frac{m_2}{2}\left( n\epsilon \right) ^2\right| \le \frac{2\epsilon ^k m_k}{\pi ^k-\epsilon ^km_k} + \frac{m_3}{3}\left( \left|n\right|\epsilon \right) ^3. \end{aligned}$$
(15)

The proof of the above is fairly straightforward and can be found in Appendix A for the sake of completion.

Remark 19

We would like to point out that the approximation (11) follows immediately from the above.Footnote 6

2.3 Recasting of the (Rescaled) Master Equation for the CL Model in the Fourier Space

The last result of this section concerns itself with recasting (4) with the Fourier coefficients of our given family of solutions. We would like to mention that as the underlying space is compact and the generator of our master equation is a bounded linear operator, there is no issue with interchanging the time derivative and spatial integration which we will perform in order to move to the Fourier space.

Lemma 20

Let \(\left\{ F_{N,k}(t)\right\} _{N\in \mathbb {N}}\) be the family of k-th marginals to the family of symmetric solutions to (4), \(\left\{ F_N(t)\right\} _{N\in \mathbb {N}}\). Then we have that

$$\begin{aligned} \begin{aligned}&\partial _t \widehat{F_{N,k}}\left( n_1,\dots ,n_k \right) \\&\quad = \frac{2\lambda N}{\left( N-1 \right) }\sum _{i<j\le k}\Bigg \{ \frac{\widehat{g_N}\left( n_i \right) +\widehat{g_N}\left( n_j \right) }{2}\widehat{F_{N,k-1}}\left( n_1,\dots , n_i+n_j,\dots ,n_k \right) \\&\qquad - \widehat{F_{N,k}}\left( n_1,\dots ,n_k \right) \Big \}+\frac{\lambda N\left( N-k \right) }{\left( N-1 \right) }\widehat{F_{N,k}}\left( n_1,\dots ,n_k \right) \sum _{i\le k}\left( \widehat{g_N}\left( n_i \right) -1 \right) , \end{aligned} \end{aligned}$$
(16)

where \(\left( n_1,\dots , n_i+n_j,\dots ,n_k \right) \) is attained by replacing \(n_i\) with \(n_i+n_j\) and omitting \(n_j\) from the original vector \(\left( n_1,\dots ,n_k \right) \) or, due to the symmetry of \(\widehat{F_{N,k-1}}\), replacing \(n_j\) with \(n_i+n_j\) and omitting \(n_i\). Identity (16) can also be rewritten as

$$\begin{aligned} \begin{aligned}&\partial _t \widehat{F_{N,k}}\left( n_1,\dots ,n_k \right) \\&\quad = \frac{\lambda N}{N-1}\left( \left( N-k \right) \sum _{i\le k}\left( \widehat{g_N}\left( n_i \right) -1 \right) -k\left( k-1 \right) \right) \widehat{F_{N,k}}\left( n_1,\dots ,n_k \right) \\&\qquad +\frac{\lambda N}{N-1}\sum _{i<j\le k} \left( \widehat{g_N}\left( n_i \right) +\widehat{g_N}\left( n_j \right) \right) \widehat{F_{N,k-1}}\left( n_1,\dots , n_i+n_j,\dots ,n_k \right) . \end{aligned} \end{aligned}$$
(17)

Proof

We start by noticing that due to the symmetry of g we find that for any \(i<j \le k\)

$$\begin{aligned} \int _{\mathcal {T}^k}g_{N}\left( \theta _i-\theta _j \right) F_{N,k-1}\left( \theta _1,\dots ,\widetilde{\theta }_{j},\dots ,\theta _k \right) e^{-i\sum _{l=1}^k n_l \theta _l}\frac{d\theta _1\dots d\theta _k}{\left( 2\pi \right) ^k} \end{aligned}$$
$$\begin{aligned} =\int _{\mathcal {T}^k}g_{N}\left( \theta _j-\theta _i \right) F_{N,k-1}\left( \theta _1,\dots ,\widetilde{\theta }_{j},\dots ,\theta _k \right) e^{-i\sum _{l=1}^k n_l \theta _l}\frac{d\theta _1\dots d\theta _k}{\left( 2\pi \right) ^k} \end{aligned}$$
$$\begin{aligned} =\widehat{g_N}\left( n_j \right) \int _{\mathcal {T}_{k-1}}F_{N,k-1}\left( \theta _1,\dots ,\widetilde{\theta }_{j},\dots ,\theta _k \right) e^{-i\sum _{l\not =j,\;l=1}^k n_l \theta _l}e^{-in_j\theta _i}\frac{d\theta _1\dots d\widetilde{\theta }_j\dots d\theta _k}{\left( 2\pi \right) ^{k-1}} \end{aligned}$$
$$\begin{aligned} =\widehat{g_N}\left( n_j \right) \widehat{F_{N,k-1}}\left( n_1,\dots , \underbrace{n_i+n_j}_{i\text {-th position}},\dots ,\widetilde{n_j},\dots , n_k \right) . \end{aligned}$$

Similarly

$$\begin{aligned} \int _{\mathcal {T}_k}g_{N}\left( \theta _i-\theta _j \right) F_{N,k-1}\left( \theta _1,\dots ,\widetilde{\theta }_{i},\dots ,\theta _k \right) e^{-i\sum _{l=1}^k n_l \theta _l}\frac{d\theta _1\dots d\theta _k}{\left( 2\pi \right) ^k} \end{aligned}$$
$$\begin{aligned} =\widehat{g_N}\left( n_i \right) \widehat{F_{N,k-1}}\left( n_1,\dots ,\widetilde{n_i},\dots ,\underbrace{n_i+n_j}_{j\text {-th position}},\dots ,n_k \right) . \end{aligned}$$

The above implies that

$$\begin{aligned} \begin{aligned} \mathcal {F}_{\mathcal {T}^k}\Bigg (&\sum _{i<j\le k}\Bigg \{\frac{g_{N}\left( \cdot _i-\cdot _j \right) }{2} \Big ( F_{N,k-1}\left( \cdot _1,\dots ,\widetilde{\cdot }_{j},\dots ,\cdot _k \right) \\&\qquad +F_{N,k-1}\left( \cdot _1,\dots ,\widetilde{\cdot }_{i},\dots ,\cdot _k \right) \Big ) -F_{N,k}\left( \cdot _1,\dots ,\cdot _k \right) \Bigg \}\Bigg )\left( n_1,\dots ,n_k \right) \\&\quad = \sum _{i<j\le k}\Bigg \{ \frac{\widehat{g_N}\left( n_j \right) }{2}\widehat{F_{N,k-1}}\left( n_1,\dots , \underbrace{n_i+n_j}_{i\text {-th position}},\dots ,n_k \right) \\&\qquad +\frac{\widehat{g_N}\left( n_i \right) }{2}\widehat{F_{N,k-1}}\left( n_1,\dots ,\underbrace{n_i+n_j}_{j\text {-th position}},\dots ,n_k \right) \\&\qquad - \widehat{F_{N,k}}\left( n_1,\dots ,n_k \right) \Big \}. \end{aligned} \end{aligned}$$
(18)

where we used the notation of \(\mathcal {F}_{\mathcal {T}^l}\left( f \right) \left( n_1,\dots ,n_l \right) =\widehat{f}\left( n_1,\dots ,n_l \right) \) when \(f\in \mathscr {P}\left( \mathcal {T}^l \right) \). Next, due to the symmetry of \(F_N\), we see that for any \(i\le k\)

$$\begin{aligned} \begin{aligned}&\int _{\mathcal {T}^k}\left( \int _{\mathcal {T}}g_{N}\left( \theta _i-\theta _{k+1} \right) F_{N,k}\left( \theta _1,\dots ,\widetilde{\theta }_{i},\dots ,\theta _{k+1} \right) \frac{d\theta _{k+1}}{2\pi } \right) e^{-i\sum _{l=1}^k n_l \theta _l}\frac{d\theta _1\dots d\theta _k}{\left( 2\pi \right) ^k}\\&\quad = \widehat{g_N}\left( n_i \right) \int _{\mathcal {T}_k} F_{N,k}\left( \theta _1,\dots ,\widetilde{\theta }_{i},\dots ,\theta _{k+1} \right) e^{-i\sum _{l\not =i,\;l=1}^k n_l \theta _l}e^{-in_i\theta _{k+1}}\frac{d\theta _1\dots d\widetilde{\theta }_i\dots d\theta _{k+1}}{\left( 2\pi \right) ^k}\\&\quad =\widehat{g_N}\left( n_i \right) \widehat{F_{N,k}}\left( n_1,\dots ,n_k \right) , \end{aligned} \end{aligned}$$

and consequently

$$\begin{aligned} \begin{aligned}&\mathcal {F}_{\mathcal {T}^k}\Bigg ( \sum _{i\le k}\Bigg \{\int _{-\pi }^{\pi }g_{N}\left( \cdot _i-\theta _{k+1} \right) F_{N,k}\left( \cdot _1,\dots ,\widetilde{\cdot }_{i},\dots ,\theta _{k+1} \right) \frac{d\theta _{k+1}}{2\pi } \\&\quad -F_{N,k}\left( \cdot _1,\dots ,\cdot _k \right) \Bigg \}\left( n_1,\dots ,n_k \right) =\widehat{F_{N,k}}\left( n_1,\dots ,n_k \right) \sum _{i\le k}\left( \widehat{g_N}\left( n_i \right) -1 \right) . \end{aligned} \end{aligned}$$
(19)

Combining (18) and (19) with the BBGKY hierarchy (7) yields

$$\begin{aligned} \begin{aligned}&\partial _t \widehat{F_{N,k}}\left( n_1,\dots ,n_k \right) = \frac{2\lambda N}{N-1}\sum _{i<j\le k}\Bigg \{ \frac{\widehat{g_N}\left( n_j \right) }{2}\widehat{F_{N,k-1}}\left( n_1,\dots , \underbrace{n_i+n_j}_{i\text {-th position}},\dots ,n_k \right) \\&\quad +\frac{\widehat{g_N}\left( n_i \right) }{2}\widehat{F_{N,k-1}}\left( n_1,\dots ,\underbrace{n_i+n_j}_{j\text {-th position}},\dots ,n_k \right) - \widehat{F_{N,k}}\left( n_1,\dots ,n_k \right) \Big \}.\\&\quad +\frac{\lambda N\left( N-k \right) }{N-1}\widehat{F_{N,k}}\left( n_1,\dots ,n_k \right) \sum _{i\le k}\left( \widehat{g_N}\left( n_i \right) -1 \right) . \end{aligned} \end{aligned}$$

Since the fact that f is symmetric implies that so is \(\widehat{f}\) we conclude (16).

To attain (17) we notice that

$$\begin{aligned} 2\sum _{i<j \le k} 1= k(k-1) \end{aligned}$$

and rearrange (16).

\(\square \)

An immediate corollary of the above is the following:

Corollary 21

A recursive formula for the k-th marginals \(\left\{ F_{N,k}\right\} _{N\in \mathbb {N}}\) is given by

$$\begin{aligned} \begin{aligned}&\widehat{F_{N,k}}\left( n_1,\dots ,n_k,t \right) = e^{-\frac{\lambda N}{N-1}\left( \left( N-k \right) \sum _{l\le k}\left( 1-\widehat{g_N}\left( n_l \right) \right) +k\left( k-1 \right) \right) t}\widehat{F_{N,k}}\left( n_1,\dots ,n_k,0 \right) \\&\quad +\frac{\lambda N}{N-1}\sum _{i<j\le k}\left( \widehat{g_N}\left( n_i \right) +\widehat{g_N}\left( n_j \right) \right) \int _{0}^t e^{-\frac{\lambda N}{N-1}\left( \left( N-k \right) \sum _{l\le k}\left( 1-\widehat{g_N}\left( n_l \right) \right) +k\left( k-1 \right) \right) \left( t-s \right) } \\&\qquad \qquad \qquad \qquad \qquad \qquad \qquad \qquad \widehat{F_{N,k-1}}\left( n_1,\dots , n_i+n_j,\dots ,n_k,s \right) ds. \end{aligned} \end{aligned}$$
(20)

Consequently

$$\begin{aligned} \begin{aligned}&\widehat{F_{N,2}}\left( n_1,n_2,t \right) \\&\quad =e^{-\frac{\lambda N}{N-1}\left( \left( N-2 \right) \sum _{l\le 2}\left( 1-\widehat{g_N}\left( n_l \right) \right) +2 \right) t}\widehat{F_{N,2}}\left( n_1,n_2,0 \right) + \left( \widehat{g_N}\left( n_1 \right) +\widehat{g_N}\left( n_2 \right) \right) \\&\frac{e^{-\frac{\lambda N}{N-1}\left( \left( N-2 \right) \sum _{l\le 2}\left( 1-\widehat{g_N}\left( n_l \right) \right) +2 \right) t}- e^{-\lambda N\left( 1-\widehat{g_N}\left( n_1+n_2 \right) \right) t}}{\left( N-1 \right) \left( 1-\widehat{g_N}\left( n_1+n_2 \right) \right) -\left( N-2 \right) \sum _{l\le 2}\left( 1-\widehat{g_N}\left( n_l \right) \right) -2}\widehat{F_{N,1}}\left( n_1+n_2,0 \right) , \end{aligned} \end{aligned}$$
(21)

where we define \(\frac{e^{\alpha t}-e^{\beta t}}{\alpha -\beta }\) to be \(te^{\alpha t}\) if \(\alpha =\beta \).

Proof

(20) is a simple ODE solution to (17). Plugging the solution for the case \(k=1\) (which is given by (10)) in the identity for \(k=2\) gives

$$\begin{aligned} \begin{aligned}&\widehat{F_{N,2}}\left( n_1,n_2,t \right) = e^{-\frac{\lambda N}{N-1}\left( \left( N-2 \right) \sum _{l\le 2}\left( 1-\widehat{g_N}\left( n_l \right) \right) +2 \right) t}\widehat{F_{N,2}}\left( n_1,n_2,0 \right) + \\&\frac{\lambda N}{N-1}\left( \widehat{g_N}\left( n_1 \right) +\widehat{g_N}\left( n_2 \right) \right) \int _{0}^t e^{-\frac{\lambda N}{N-1}\left( \left( N-2 \right) \sum _{l\le 2}\left( 1-\widehat{g_N}\left( n_l \right) \right) +2 \right) \left( t-s \right) } \widehat{F_{N,1}}\left( n_1+n_2,s \right) ds \\&\quad =e^{-\frac{\lambda N}{N-1}\left( \left( N-2 \right) \sum _{l\le 2}\left( 1-\widehat{g_N}\left( n_l \right) \right) +2 \right) t}\widehat{F_{N,2}}\left( n_1,n_2,0 \right) +\frac{\lambda N}{N-1}\left( \widehat{g_N}\left( n_1 \right) +\widehat{g_N}\left( n_2 \right) \right) \\&\left( \int _{0}^t e^{-\frac{\lambda N}{N-1}\left( \left( N-2 \right) \sum _{l\le 2}\left( 1-\widehat{g_N}\left( n_l \right) \right) +2 \right) \left( t-s \right) } e^{\lambda N \left( \widehat{g_{N}}\left( n_1+n_2 \right) -1 \right) s}ds \right) \widehat{F_{N,1}}\left( n_1+n_2,0 \right) . \end{aligned} \end{aligned}$$

Using the fact that

$$\begin{aligned} \int _{0}^{t}e^{\alpha \left( t-s \right) }e^{\beta s}ds = \frac{e^{\alpha t}-e^{\beta t}}{\alpha -\beta }, \end{aligned}$$
(22)

with the convention that was mentioned in the statement of the corollary, we conclude (21). \(\square \)

With this in hand, we are ready to show our main theorems.

3 The Case of Strong Interactions

In this section we will show the emergence of order, and its propagation, in the case of strong interaction in the CL model. We start by noticing that Corollary 21 in Sect. 2 gives us an inkling to why Theorem 8 holds. Indeed, under the assumption that \(\lim _{N\rightarrow \infty }N\epsilon _N^2=0\) we can show that

$$\begin{aligned} \lim _{N\rightarrow \infty } N\left( 1-\widehat{g_N}(n) \right) =0 \end{aligned}$$

for any fixed n and consequently, using (21), we see that as long as \(F_{N,1}(0)\) and \(F_{N,2}(0)\) converge weakly to \(f_1\) and \(f_2\) respectively we have that

$$\begin{aligned} \widehat{f_2}\left( n_1,n_2,t \right) =\lim _{N\rightarrow \infty }\widehat{F_{N,2}}\left( n_1,n_2,t \right) = e^{-2\lambda t} \widehat{f}_2\left( n_1,n_2 \right) + \left( 1-e^{-2\lambda t} \right) \widehat{f_1}\left( n_1+n_2 \right) \end{aligned}$$
$$\begin{aligned} =\mathcal {F}_{\mathcal {T}^2}\left( e^{-2\lambda t}f_2\left( \cdot _1,\cdot _2 \right) +\left( 1-e^{-2\lambda t} \right) f_1\left( \cdot _1 \right) \delta \left( \cdot _2-\cdot _1 \right) \right) \left( n_1,n_2 \right) . \end{aligned}$$

In other words

$$\begin{aligned} f_2\left( \theta _1,\theta _2,t \right) =e^{-2t}f_2\left( \theta _1,\theta _2 \right) + \left( 1-e^{-2t} \right) f_1\left( \theta _1 \right) \delta \left( \theta _2-\theta _1 \right) \end{aligned}$$

which fits the statements of Theorem 8. Let us show the proof in the general case:

Proof of Theorem 8

Using Lemma 18, we find that

$$\begin{aligned} \left|\widehat{g_N}(n) -1 - \frac{m_2}{2}\epsilon _N^2 n^2\right| \le C \epsilon _N^3 \left|n\right|^3 \end{aligned}$$

for all \(n\in \mathbb {Z}\). Thus, if \(\lim _{N\rightarrow \infty }N\epsilon _N^2=0\) we have that

$$\begin{aligned} 0\le N\left( 1-\widehat{g_N}(n) \right) \le \left( \frac{m_2}{2}n^2+C \epsilon _N \left|n\right|^3 \right) N\epsilon _N^2, \end{aligned}$$

where we used the fact that the Fourier coefficient of any real and symmetric probability density is always real and bounded in absolute value by 1.

We conclude from the above that for any \(n\in \mathbb {Z}\) we have that

$$\begin{aligned} \lim _{N\rightarrow \infty } N\left( 1-\widehat{g_N}(n) \right) =0. \end{aligned}$$

Next, we recall that Lemma 15 assures us that for any \(\left( n_1,\dots ,n_k \right) \in \mathbb {Z}^k\) we have that

$$\begin{aligned} \lim _{N\rightarrow \infty }\widehat{F_{N,k}(0)}\left( n_1,\dots ,n_k \right) =\widehat{f_k}\left( n_1,\dots ,n_k \right) . \end{aligned}$$

Moreover, since the Fourier coefficients of any probability measure are bounded uniformly by 1, we can apply the Dominated Convergence Theorem to our recursive formula, (20), and conclude that for any \(t>0\) and any \(k\in \mathbb {N}\), \(\lim _{N\rightarrow \infty }\widehat{F_{N,k}}\left( n_1,\dots ,n_k,t \right) =\widehat{f_{k}}\left( n_1,\dots ,n_k,t \right) \) exists and satisfiesFootnote 7

$$\begin{aligned} \begin{aligned} \widehat{f_{k}}\left( n_1,\dots ,n_k,t \right)&= e^{-\lambda k\left( k-1 \right) t}\widehat{f_k}\left( n_1,\dots ,n_k \right) \\&\quad +2\lambda \sum _{i<j\le k}\int _{0}^t e^{-\lambda k\left( k-1 \right) \left( t-s \right) } \widehat{f_{k-1}}\left( n_1,\dots , n_i+n_j,\dots ,n_k,s \right) ds, \end{aligned} \end{aligned}$$
(23)

where we have used the fact that \(\lim _{N\rightarrow \infty }\widehat{g_N}(n)=1\) for any \(n\in \mathbb {Z}\). This shows (12) due to the uniqueness of the Fourier coefficients and the fact that

$$\begin{aligned} \int _{\mathcal {T}^k} f_{k-1}\left( \theta _1,\dots ,\widetilde{\theta _{i}},\dots , \theta _k \right) \delta \left( \theta _i-\theta _j \right) e^{-i \sum _{l=1}^k n_l\theta _l}\frac{d\theta _1\dots d\theta _k}{\left( 2\pi \right) ^k} \end{aligned}$$
$$\begin{aligned} =\int _{\mathcal {T}^k} f_{k-1}\left( \theta _1,\dots ,\widetilde{\theta _{i}},\dots , \theta _k \right) e^{-i \sum _{l\not =i,\;l=1}^k n_l\theta _l}e^{-in_i\theta _j}\frac{d\theta _1\dots d\widetilde{\theta _i}\dots d\theta _k}{\left( 2\pi \right) ^{k-1}} \end{aligned}$$
$$\begin{aligned} =\widehat{f_{k-1}}\left( n_1,\dots ,n_i+n_j,\dots , n_k \right) . \end{aligned}$$

To show the convergence to an \(f_1\)-ordered state as time goes to infinity we notice that, just like the Lemma 15 and by utilising Lemma 16, it is enough for us to show that

$$\begin{aligned} \lim _{t\rightarrow \infty }\widehat{f_k}\left( n_1,\dots ,n_k,t \right) = \widehat{f_1}\left( \sum _{j=1}^k n_j \right) . \end{aligned}$$

We will achieve this by showing that for any \(k\ge 2\) there exists an explicit constant \(c_k\) which depends only on k such that

$$\begin{aligned} \left|\widehat{f_k}\left( n_1,\dots ,n_k,t \right) -\widehat{f_1}\left( \sum _{j=1}^k n_j \right) \right|\le c_k e^{-2\lambda t}. \end{aligned}$$

We start by noticing that for \(k=1\) (23) implies that

$$\begin{aligned} \widehat{f_1}(n,t)=\widehat{f_1}(n). \end{aligned}$$

Consequently, for \(k=2\) we have that

$$\begin{aligned} \begin{aligned}&\widehat{f_{2}}\left( n_1,n_2,t \right) = e^{-2\lambda t}\widehat{f_2}\left( n_1,n_2 \right) +2\lambda \int _{0}^t e^{-2\lambda \left( t-s \right) } \widehat{f_{1}}\left( n_1+n_2,s \right) ds\\&\quad = e^{-2\lambda t}\widehat{f_2}\left( n_1,n_2 \right) +2\lambda \left( \int _{0}^t e^{-2\lambda \left( t-s \right) }ds \right) \widehat{f_{1}}\left( n_1+n_2 \right) \\&\quad =e^{-2\lambda t}\widehat{f_2}\left( n_1,n_2 \right) + \left( 1-e^{-2\lambda t} \right) \widehat{f_1}\left( n_1+n_2 \right) , \end{aligned} \end{aligned}$$
(24)

from which we find that

$$\begin{aligned} \left|\widehat{f_2}\left( n_1,n_2,t \right) -\widehat{f_1}\left( n_1+n_2 \right) \right| \le 2 e^{-2\lambda t}=c_2e^{-2\lambda t}. \end{aligned}$$

We continue by induction: assume the claim holds for \(k-1\ge 2\) and consider k. Since

$$\begin{aligned} 2\lambda \sum _{i<j\le k}\int _0^t e^{-\lambda k\left( k-1 \right) \left( t-s \right) }ds =\lambda k\left( k-1 \right) \int _0^t e^{-\lambda k\left( k-1 \right) \left( t-s \right) }ds = 1- e^{-\lambda k\left( k-1 \right) t} \end{aligned}$$

we find that

$$\begin{aligned} \left|\widehat{f_k}\left( n_1,\dots ,n_k,t \right) - \widehat{f_1}\left( \sum _{j=1}^k n_j \right) \right|\le e^{-\lambda k\left( k-1 \right) t}\left|\widehat{f_1}\left( \sum _{j=1}^k n_j \right) \right| + \Bigg | e^{-\lambda k\left( k-1 \right) t}\widehat{f_k}\left( n_1,\dots ,n_k \right) \end{aligned}$$
$$\begin{aligned} +2\lambda \sum _{i<j\le k}\int _{0}^t e^{-\lambda k\left( k-1 \right) \left( t-s \right) } \left( \widehat{f_{k-1}}\left( n_1,\dots , n_i+n_j,\dots ,n_k,s \right) -\widehat{f_1}\left( \sum _{j=1}^k n_j \right) \right) ds\Bigg | \end{aligned}$$
$$\begin{aligned} \le 2 e^{-\lambda k\left( k-1 \right) t} + 2\lambda c_{k-1} \sum _{i<j\le k}\int _{0}^t e^{-\lambda k\left( k-1 \right) \left( t-s \right) } e^{-2\lambda s}ds \end{aligned}$$
$$\begin{aligned} =2 e^{-\lambda k\left( k-1 \right) t} + \lambda c_{k-1}k\left( k-1 \right) \frac{ e^{-2\lambda t}-e^{-\lambda k\left( k-1 \right) t} }{\lambda \left( k\left( k-1 \right) -2 \right) } \end{aligned}$$

where we have used (22) and the fact that \(2\sum _{i<j\le k}1 = k(k-1)\). Since \(k\ge 3\) we see that

$$\begin{aligned} \left|\widehat{f_k}\left( n_1,\dots ,n_k,t \right) - \widehat{f_1}\left( \sum _{j=1}^k n_j \right) \right|\le 2 e^{-\lambda k\left( k-1 \right) t} + c_{k-1}k\left( k-1 \right) \frac{ e^{-2\lambda t}}{k\left( k-1 \right) -2} \end{aligned}$$
$$\begin{aligned} \le \left( 2+\frac{c_{k-1}k\left( k-1 \right) }{k\left( k-1 \right) -2} \right) e^{-2\lambda t} = c_k e^{-2\lambda t}. \end{aligned}$$

We have thus shown the first statement of the theorem.

Next, we show the propagation of order by induction. Recall that according to Lemma 17 it will be enough for us to show that

$$\begin{aligned} \widehat{f_k}\left( n_1,\dots ,n_k,t \right) = \widehat{f_1}\left( \sum _{j=1}^k n_j \right) \end{aligned}$$

for any \(t>0\) and \(\left( n_1,\dots ,n_k \right) \in \mathbb {Z}^k\). Using Lemma 16 and the fact that \(\left\{ F_{N}(0)\right\} _{N\in \mathbb {N}}\) is \(f_1-\)ordered we conclude that

$$\begin{aligned} \widehat{f_2}\left( n_1,n_2 \right) =\lim _{N\rightarrow \infty }\widehat{F_{N,2}}\left( n_1,n_2,0 \right) =\widehat{f_1}\left( n_1+n_2 \right) . \end{aligned}$$

Using the fact that \(\widehat{f_1}(n,t)=\widehat{f_1}(n)\) for all \(t>0\) together with the above and (24) we find that

$$\begin{aligned} \begin{aligned}&\widehat{f_{2}}\left( n_1,n_2,t \right) =e^{-2\lambda t}\widehat{f_2}\left( n_1,n_2 \right) + \left( 1-e^{-2\lambda t} \right) \widehat{f_1}\left( n_1+n_2 \right) = \widehat{f_1}\left( n_1+n_2 \right) , \end{aligned} \end{aligned}$$

which shows our base induction step. We now assume that

$$\begin{aligned} \widehat{f_{k-1}}\left( n_1,\dots ,n_{k-1},t \right) = \widehat{f_1}\left( \sum _{l=1}^{k-1}n_l \right) \end{aligned}$$

for all \(t>0\) and \(\left( n_1,\dots ,n_{k-1} \right) \in \mathbb {Z}^{k-1}\), where \(k-1\ge 2\). As in the case \(k=2\) we know that the fact that \(\left\{ F_{N}(0)\right\} _{N\in \mathbb {N}}\) is \(f_1\)-ordered implies that

$$\begin{aligned} \widehat{f_k}\left( n_1,\dots ,n_k \right) =\lim _{N\rightarrow \infty }\widehat{F_{N,k}}\left( n_1,\dots ,n_k,0 \right) =\widehat{f_1}\left( \sum _{j=1}^k n_j \right) . \end{aligned}$$

Using our recursive formula (23) we find that for any \(t>0\)

$$\begin{aligned} \begin{aligned}&\widehat{f_{k}}\left( n_1,\dots ,n_k,t \right) \\&\quad = e^{-\lambda k\left( k-1 \right) t}\widehat{f_k}\left( n_1,\dots ,n_k \right) \\&\qquad +2\lambda \sum _{i<j\le k}\int _{0}^t e^{-\lambda k\left( k-1 \right) \left( t-s \right) } \widehat{f_{k-1}}\left( n_1,\dots , n_i+n_j,\dots ,n_k,s \right) ds\\&\quad =e^{-\lambda k\left( k-1 \right) t}\widehat{f_{1}}\left( \sum _{j=1}^k n_j \right) +2\lambda \sum _{i<j\le k}\int _{0}^t e^{-\lambda k\left( k-1 \right) \left( t-s \right) } \widehat{f_{1}}\left( \sum _{j=1}^k n_j \right) ds\\&\quad =e^{-\lambda k\left( k-1 \right) t}\widehat{f_{1}}\left( \sum _{j=1}^k n_j \right) +2\lambda \sum _{i<j\le k}\left( \int _{0}^t e^{-\lambda k\left( k-1 \right) \left( t-s \right) }ds \right) \widehat{f_{1}}\left( \sum _{j=1}^k n_j \right) \\&\quad =e^{-\lambda k\left( k-1 \right) t}\widehat{f_{1}}\left( \sum _{j=1}^k n_j \right) +\left( 1-e^{-\lambda k\left( k-1 \right) t} \right) \widehat{f_{1}}\left( \sum _{j=1}^k n_j \right) =\widehat{f_{1}}\left( \sum _{j=1}^k n_j \right) . \end{aligned} \end{aligned}$$

The proof, and with it this section, is now complete \(\square \)

4 The Case of Balanced Interactions

In this penultimate section, we consider the case where the interaction and time scaling are balanced. Surprisingly, Corollary 21 in Sect. 2 not only gives us the intuition to why Theorem 8 is true but also gives us the means to show that in the case where \(N\epsilon _N^2=1\) the solutions to the rescaled CL model can’t be ordered. The key idea in showing this is expressed in the following lemma:

Lemma 22

Consider a family of symmetric probability densities \(F_N\in \mathscr {P}\left( \mathcal {T}^N \right) \) with \(N\in \mathbb {N}\). If \(\left\{ F_N\right\} _{N\in \mathbb {N}}\) is \(f-\)ordered then

$$\begin{aligned} \lim _{N\rightarrow \infty }\widehat{F_{N,2}}\left( n,-n \right) = 1. \end{aligned}$$

Proof

Using Lemma 16 we see that if \(\left\{ F_N\right\} _{N\in \mathbb {N}}\) is \(f-\)ordered then

$$\begin{aligned} \lim _{N\rightarrow \infty }\widehat{F_{N,2}}\left( n,-n \right) = \widehat{f}\left( n+(-n) \right) =\widehat{f}(0)=1 \end{aligned}$$

as \(f\in \mathscr {P}\left( \mathcal {T} \right) \). \(\square \)

Proof of Theorem 11

The fact that \(\left\{ F_N(t)\right\} _{N\in \mathbb {N}}\) is not chaotic has been shown in the works of Carlen et al. [5, 6]. To show the lack of order we start by noticing that in this setting, Lemma 18 implies that for any \(n\in \mathbb {Z}\)

$$\begin{aligned} \lim _{N\rightarrow \infty } N\left( 1-\widehat{g_N}(n) \right) = \frac{m_2 n^2}{2}. \end{aligned}$$

Consequently, assuming that \(\left\{ F_{N,1}(0)\right\} _{N\in \mathbb {N}}\) and \(\left\{ F_{N,2}(0)\right\} _{N\in \mathbb {N}} \) converge weakly to \(f_1\) and \(f_2\) respectively and using (21), we find that for any \(t>0\),

$$\begin{aligned} \begin{aligned}&\lim _{N\rightarrow \infty }\widehat{F_{N,2}}\left( n_1,n_2,t \right) =e^{-\lambda \left( \frac{m_2}{2}\left( n_1^2+n_2^2 \right) +2 \right) t}\widehat{f_{2}}\left( n_1,n_2 \right) \\&\quad +\frac{4\left( e^{-\lambda \left( \frac{m_2}{2}\left( n_1^2+n_2^2 \right) +2 \right) t}- e^{-\frac{\lambda m_2}{2}\left( n_1+n_2 \right) ^2 t} \right) }{m_2\left( \left( n_1+n_2 \right) ^2-n_1^2-n_2^2 \right) -4}\widehat{f_{1}}\left( n_1+n_2 \right) \end{aligned} \end{aligned}$$

In particular, for any \(n\not =0\) and \(t>0\)

$$\begin{aligned} \lim _{N\rightarrow \infty }\widehat{F_{N,2}}\left( n,-n,t \right) =e^{-\lambda \left( m_2 n^2+2 \right) t}\widehat{f_{2}}\left( n,-n \right) - \frac{2\left( e^{-\lambda \left( m_2 n^2+2 \right) t}- 1 \right) }{m_2n^2+2} \end{aligned}$$
$$\begin{aligned} =e^{-\lambda \left( m_2 n^2+2 \right) t}\widehat{f_{2}}\left( n,-n \right) + \frac{2\left( 1-e^{-\lambda \left( m_2 n^2+2 \right) t} \right) }{m_2n^2+2} \end{aligned}$$
$$\begin{aligned} \le e^{-\lambda \left( m_2 n^2+2 \right) t} + \frac{2\left( 1-e^{-\lambda \left( m_2 n^2+2 \right) t} \right) }{m_2n^2+2} = \frac{m_2 n^2}{m_2n^2+2}e^{-\lambda \left( m_2 n^2+2 \right) t} + \frac{2}{m_2n^2+2} \end{aligned}$$
$$\begin{aligned}< \frac{m_2 n^2}{m_2n^2+2}e^{-2 \lambda t} + \frac{2}{m_2n^2+2} < 1. \end{aligned}$$

Due to Lemma 22 we conclude that \(\left\{ F_N(t)\right\} _{N\in \mathbb {N}}\) can’t be ordered for any \(t>0\), which completes the proof. \(\square \)

We conclude this short section with the proof of Theorem 12.

Proof of Theorem 12

Much like our previous proof, we start with the fact that in our setting

$$\begin{aligned} \lim _{N\rightarrow \infty } N\left( 1-\widehat{g_N}(n) \right) = \frac{m_2 n^2}{2}. \end{aligned}$$

Identity (23) together with the fact that \(\left\{ F_{N,1}(0)\right\} _{N\in \mathbb {N}}\) and \(\left\{ F_{N,2}(0)\right\} _{N\in \mathbb {N}} \) converge weakly to \(f_1\) and \(f_2\) respectively imply that

$$\begin{aligned} \lim _{N\rightarrow \infty }\widehat{F_{N,1}}\left( n_1,t \right) =\lim _{N\rightarrow \infty }e^{\lambda N \left( \widehat{g_{\epsilon _N}}\left( n \right) -1 \right) t}\widehat{F_{N,1}}\left( n,0 \right) =e^{-\frac{\lambda m_2}{2}n^2t}\widehat{f_1}(n), \end{aligned}$$

and

$$\begin{aligned} \begin{aligned}&\lim _{N\rightarrow \infty }\widehat{F_{N,2}}\left( n_1,n_2,t \right) =e^{-\lambda \left( \frac{m_2}{2}\left( n_1^2+n_2^2 \right) +2 \right) t}\widehat{f_{2}}\left( n_1,n_2 \right) \\&\quad + \frac{4\left( e^{-\lambda \left( \frac{m_2}{2}\left( n_1^2+n_2^2 \right) +2 \right) t}- e^{-\frac{\lambda m_2}{2}\left( n_1+n_2 \right) ^2 t} \right) }{m_2\left( \left( n_1+n_2 \right) ^2-n_1^2-n_2^2 \right) -4}\widehat{f_{1}}\left( n_1+n_2 \right) . \end{aligned} \end{aligned}$$

The convergence of the Fourier coefficients together with Lemma 15 imply the desired convergence to \(f_1(t)\) and \(f_2(t)\) given by the inverse transform of the above limits.Footnote 8 To show (13) and (14) we notice that

$$\begin{aligned} \lim _{t\rightarrow \infty }\widehat{f_1}\left( n_1,t \right) = {\left\{ \begin{array}{ll} 1,&{} n_1=0,\\ 0,&{} n_1\not =0, \end{array}\right. } = \mathcal {F}_{\mathcal {T}}\left( 1 \right) (n_1) \end{aligned}$$

and

$$\begin{aligned} \lim _{t\rightarrow \infty }\widehat{f_2}(n_1,n_2,t) = {\left\{ \begin{array}{ll} \frac{4}{m_2 \left( n_1^2+n_2^2 \right) +4},&{} n_1+n_2=0,\\ 0,&{} n_1+n_2\not =0, \end{array}\right. }= {\left\{ \begin{array}{ll} \frac{2}{m_2 n_1^2+2},&{} n_1+n_2=0,\\ 0,&{} n_1+n_2\not =0, \end{array}\right. }. \end{aligned}$$

The latter implies (14) since (with the help of the Dominated Convergence Theorem) we have that

$$\begin{aligned} \int _{\mathcal {T}^2} \left( \sum _{j\in \mathbb {Z}}\frac{2}{m_2 j^2+2}e^{ij \left( \theta _1-\theta _2 \right) } \right) e^{-in_1\theta _1-in_2\theta _2}\frac{d\theta _1d\theta _2}{\left( 2\pi \right) ^2} \end{aligned}$$
$$\begin{aligned} =\sum _{j\in \mathbb {Z}}\frac{2}{m_2 j^2+2}\int _{\mathcal {T}^2}e^{ij \left( \theta _1-\theta _2 \right) }e^{-in_1\theta _1-in_2\theta _2}\frac{d\theta _1d\theta _2}{\left( 2\pi \right) ^2}=\sum _{j\in \mathbb {Z}}\frac{2\delta _{j,n_1}\delta _{j,-n_2}}{m_2 j^2+2} \end{aligned}$$

where \(\delta _{i,j}\) is the Kronecker delta, and consequently

$$\begin{aligned} \mathcal {F}_{\mathcal {T}^2}\left( \mathcal {H}\left( \cdot _1-\cdot _2 \right) \right) \left( n_1,n_2 \right) = {\left\{ \begin{array}{ll} \frac{2}{m_2 n_1^2+2},&{} n_1+n_2=0,\\ 0,&{} n_1+n_2\not =0. \end{array}\right. } \end{aligned}$$

\(\square \)

5 Final Remarks

5.1 On the Notion of Order

Our definition of order (Definition 5) was motivated by our expectation to see total adherence in the CL and other models—a “perpendicular” behaviour to chaoticity. One might argue that a more appropriate name would be “perfect order” or “perfect alignment” to take into account that some partial order/alignment can also manifest (as might be indicated by Theorem 12). However, to keep our introduction of this new asymptotic notion more coherent we elected to use the simpler term.

We would like to emphasise that the main idea behind the notion of order is that for any \(k\in \mathbb {N}\) the limit process retains only one degree of randomness (vs. chaoticity which has k degrees of randomness). This means that this notion can be adapted to other situations where we don’t necessarily expect that all the variables equal in the limit, but where one “average element” completely determines the limiting behaviour of any finite group of elements (for instance, a one dimensional chain of elements whose variables are always a fixed distance from each other).

5.2 On the Generation of Order

As was mentioned in Remark 10, Theorem 8 guarantees the generation of order, though this statement is not as strong as we would hope. In particular, in order to see order appearing we need to consider the limiting marginals (i.e. take N to infinity) and then take time to infinity. It would be interesting to see if we can find an explicit function t(N), that goes to infinity when N goes to infinity, such that \(F_{N,k}\left( t\left( N \right) \right) \) converges to an ordered state as N goes to infinity. We suspect that to achieve this one might need a stronger notion of convergence than weak convergence of measures which is also quantitative.

5.3 Between Order and Chaos

The balanced setting, discussed in Theorems 11 and 12, poses an interesting “in between” case between our order and suspected chaos. While no order is observed in this case, Theorem 12 suggests that there is still a chance we will see some partial adherence, at least in the second marginal, with deviations given by a fixed function. This motivates us to consider a potential notion of partial order, where the delta functionals in (6) are replaced by some functions that measure how close the variables may get. In other words \(\Pi _1\left( d\mu _N \right) \) converges to a profile f and \(\Pi _k\left( d\mu _N \right) \) converges to something of the form

$$\begin{aligned} \frac{1}{k!}\sum _{\sigma \in S_k}f\left( \theta _{\sigma (1)} \right) \prod _{i=1}^{k-1} h\left( \theta _{\sigma (i)}-\theta _{\sigma (i+1)} \right) , \end{aligned}$$

for some \(h\in \mathscr {P}\left( \mathcal {T} \right) \) and where \(S_k\) is the group of permutation of order k. It is unclear at this point if the above is suitable to capture the behaviour of even the simple CL model in the balanced scaling, but the investigation of such a notion is, in our opinion, an exciting prospect which we will pursue.

5.4 Additional Models

The CL model did not only motivate the definition of the new notion of order—it was also an ideal model to test it. One notable issue with this model, however, is its simplicity. In particular, its BBGKY hierarchy is closed—something that doesn’t happen in most many element models. It would be interesting to try and test the notion of order in other mean field models that should exhibit strong adherence. Prime candidates are swarming models such as the Bertin, Droz and Grégoire model, which was introduced in [3] and is mentioned in the works of Carlen et al. [5, 6], and societal models such as the opinion models presented in the review paper of Chaintron and Diez [7]. Following on ideas presented in the original works on the CL model as well as in this paper, one would expect that the first step to deal with any mean field model which may exhibit a phenomena of order would be to find the appropriate scaling. This might not be as easy a feat as it is in the CL model and additional technical difficulties are expected due to the coupled BBGKY hierarchy.