Phase Transitions and Macroscopic Limits in a BGK Model of Body-Attitude Coordination

Degond, P.; Diez, A.; Frouvelle, A.; Merino-Aceituno, S.

doi:10.1007/s00332-020-09632-x

Phase Transitions and Macroscopic Limits in a BGK Model of Body-Attitude Coordination

Open access
Published: 30 May 2020

Volume 30, pages 2671–2736, (2020)
Cite this article

Download PDF

You have full access to this open access article

Journal of Nonlinear Science Aims and scope Submit manuscript

Phase Transitions and Macroscopic Limits in a BGK Model of Body-Attitude Coordination

Download PDF

P. Degond ORCID: orcid.org/0000-0002-4886-6968¹,
A. Diez¹,
A. Frouvelle² &
…
S. Merino-Aceituno^3,4

1513 Accesses
11 Citations
1 Altmetric
Explore all metrics

Abstract

In this article we investigate the phase transition phenomena that occur in a model of self-organisation through body-attitude coordination. Here, the body attitude of an agent is modelled by a rotation matrix in ${\mathbb {R}}^3$ as in Degond et al. (Math Models Methods Appl Sci 27(6):1005–1049, 2017). The starting point of this study is a BGK equation modelling the evolution of the distribution function of the system at a kinetic level. The main novelty of this work is to show that in the spatially homogeneous case, self-organisation may appear or not depending on the local density of agents involved. We first exhibit a connection between body-orientation models and models of nematic alignment of polymers in higher-dimensional space from which we deduce the complete description of the possible equilibria. Then, thanks to a gradient-flow structure specific to this BGK model, we are able to prove the stability and the convergence towards the equilibria in the different regimes. We then derive the macroscopic models associated with the stable equilibria in the spirit of Degond et al. (Arch Ration Mech Anal 216(1):63–115, 2015, Math Models Methods Appl Sci 27(6):1005–1049, 2017).

Body-Attitude Alignment: First Order Phase Transition, Link with Rodlike Polymers Through Quaternions, and Stability

MaxEP and Stable Configurations in Fluid–Solid Interactions

A de Broglie–Bohm Model of Pure Shape Dynamics: N-Body system

Article Open access 18 June 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The model studied in the present work is a new elaboration of the work initiated in Degond et al. (2017) to model collective behaviour of agents described by their position and body attitude. New results about emergence of phenomena of body-attitude coordination are presented in the context of a Bhatnagar–Gross–Krook (BGK) model. Such models can be applied to many biological systems such as flocking birds (Hildenbrandt et al. 2010), fish school (Hemelrijk et al. 2010; Hemelrijk and Hildenbrandt 2012) or sperm motion (Degond and Navoret 2015). These systems are constituted by a large number of self-propelled agents which move at a constant speed and try to imitate their neighbours by moving in the same direction and trying to coordinate their body attitude. The agents are modelled by a moving frame in dimension 3, i.e. three orthogonal axes, one of which gives the direction of the motion and the two others the body orientation. In this work, as in Degond et al. (2017), the body attitude is modelled by a rotation matrix in dimension 3, i.e. an element of the special orthogonal group $SO_3({\mathbb {R}})$. In Degond et al. (2018b) agents are modelled by quaternions.

Collective behaviour in many-agent systems has been a thoroughly studied subject in the mathematical literature, from both theoretical and applied points of view. Among the models which have received the most attention, one can cite the Cucker–Smale model (Cucker and Smale 2007; Ha and Liu 2009; Motsch and Tadmor 2011), attractive-repulsive models (Carrillo et al. 2017) or the Vicsek model for self-propelled particles (Vicsek et al. 1995). A recent review can be found in Albi et al. (2019). The present work belongs to the class of Vicsek-inspired models. Such models have two main distinctive features, first the assumption that the particles are self-propelled and secondly a geometrical constraint: in the original work of Vicsek, the velocities of the particles have constant norm and the dynamics therefore takes place on the sphere ${\mathbb {S}}^{n-1}$ in dimension n ($n=2$ in Vicsek et al. 1995). Here the dynamics takes place on the Riemannian manifold $SO_3({\mathbb {R}})$. In the statistical physics literature, such models belong to the field of (soft-)active matter physics and are widely used to model biological phenomena, from the microscopic motility of cells to animal flocking. A review can be found in Marchetti et al. (2013).

The tools used to study models of collective behaviour are generally borrowed from the mathematical kinetic theory of gases which gives a mathematical framework to study many-particle systems. At a microscopic scale, the motion of each particle is detailed (individual-based model, IBM) through ordinary differential equations (ODEs) coming from Newton’s laws or through stochastic processes. When the number of particles is large, the whole system is described at a mesoscopic scale by a kinetic partial differential equation such as the Boltzmann, Fokker–Planck or BGK equation. Finally, large-scale dynamics is described by macroscopic equations (Euler, Navier–Stokes...). A review of the main results of kinetic theory of gases can be found in Degond (2004). In particular, the BGK equation (for Bhatnagar–Gross–Krook) was introduced in Bhatnagar et al. (1954) as a substitute for the Boltzmann equation in the context of gas dynamics. The BGK operator is a relaxation operator towards a Maxwellian having the same moments as the distribution function of the system. Its mathematical properties and relevance in the mathematical kinetic theory of gases have been studied in particular in Perthame (1989) and Saint-Raymond (2003). The BGK operator has been used in a model of collective dynamics of self-propelled particles in Dimarco and Motsch (2016). However, together with Degond et al. (2018a), it is the first time that it is rigorously studied in a body-attitude coordination model.

The main mathematical challenge in classical kinetic theory is the rigorous derivation of the kinetic equations from the IBM and of the macroscopic models from the kinetic equations. These questions are at the core of Hilbert’s sixth problem and have received much attention in the last decades. Many different techniques have been developed to derive kinetic equations from hard-sphere gases (Boltzmann–Grad limit Lanford 1975; Gallagher et al. 2014), from systems of interacting particles (mean-field limit and propagation of chaos Jabin 2014; Hauray and Jabin 2007; Sznitman 1991) or from stochastic processes (and in particular jump processes Mischler and Mouhot 2013; Kac 1956). Some of these techniques have been adapted to problems arising in the study of collective behaviour (Chuang et al. 2007; Bolley et al. 2011, 2012). The passage from kinetic equations to macroscopic models generally depends on physical constraints and in particular on conservation laws (hydrodynamic limits, Hilbert and Chapman–Enskog methods, see Cercignani et al. (2013), Degond (2004) for a review) and is still an active research field (Guo and Jang 2010; Esposito et al. 2018; Caflisch 1980; Golse and Saint-Raymond 2004). In the context of self-propelled particles, due to the lack of conservation laws which normally hold in the classical kinetic theory of gases, specific tools are needed. In Degond and Motsch (2008), a methodological breakthrough has been achieved by introducing the so-called generalised collisional invariants (GCI) to rigorously link kinetic and macroscopic equations in the context of collective behaviour of self-propelled particles. This technique is now rigorously justified (Jiang et al. 2016) and has already been successfully applied to a wide range of problems (Jiang et al. 2017; Zhang and Jiang 2017). It will be the key here to derive the macroscopic model in Sect. 6. This will lead to a system of partial differential equations on the mean density and body attitude, referred as the self-organised hydrodynamics for body-attitude coordination (SOHB) in Degond et al. (2017).

The aim of this work is to show the emergence of collective behaviour and self-organisation which give rise to macroscopic scale patterns such as clusters and travelling bands. These patterns emerge from the collective interactions and are not directly encoded in the behaviour of the individual particles as described by the IBM. The continuum version of the Vicsek model (Degond and Motsch 2008) named the self-organised hydrodynamics (SOH) model is an exemple of a model able to describe such emergence of self-organised dynamics. The Vicsek model describes a system where agents try to imitate their neighbours by adapting their direction of motion to the average direction of their neighbours. It has been shown that, in a certain scaling and when the equilibrium of the system is reached, the directions of motion of the agents are not uniformly distributed but follow a von Mises distribution. For $\kappa \in {\mathbb {R}}_+$ and $\Omega \in {\mathbb {S}}^{n-1}$ the von Mises distribution of parameters $\kappa $ and $\Omega $ is the probability density function (PDF) on ${\mathbb {S}}^{n-1}$ defined by:

$$\begin{aligned} M_{\kappa \Omega }(\omega ):=\frac{e^{\kappa \Omega \cdot \omega }}{\int _{{\mathbb {S}}^{n-1}}e^{\kappa \Omega \cdot \omega '}\,\mathrm{d}\omega '}, \end{aligned}$$

where the dot product is the usual dot product in ${\mathbb {R}}^n$. This model (Degond and Motsch 2008) has been the starting point of many other models of self-organised dynamics, including Degond et al. (2017) for the body-attitude coordination. In this context, we define the von Mises distribution of parameter $J\in {\mathscr {M}}_3({\mathbb {R}})$ (a $3\times 3$ real matrix) as the following PDF on $SO_3({\mathbb {R}})$:

$$\begin{aligned} M_J(A):=\frac{e^{J\cdot A}}{\int _{SO_3({\mathbb {R}})} e^{J\cdot A'}\,\mathrm{d}A'}, \end{aligned}$$

where the dot product and the measure on $SO_3({\mathbb {R}})$ come from the Riemannian structure of $SO_3({\mathbb {R}})$ detailed in Sect. 3.

In the present work, we focus on phase transition phenomena between non-organised and organised dynamics (collective motion). We will prove that the spatial density of agents is the key parameter which encodes the main features of phase transitions: in low density regions, no self-organised dynamics appears but when the density crosses a critical value, self-organised dynamics, given by a von Mises distribution for the body-attitude, becomes a stable equilibria of the system. This phase transition in the dynamics is purely an emergent phenomena, in the sense that at the macroscopic scale, different equations are required to describe the dynamics for different values of the density of agents, whereas for the IBM and at a mesoscopic level, the dynamics is described by one unique (system of) equation(s).

The starting point of this study is the BGK equation

$$\begin{aligned} \partial _t f+(Ae_1\cdot \nabla _x)f=\rho _f M_{J_f}-f, \end{aligned}$$

where f(t, x, A) is a probability measure which gives the distribution of agents at position $x\in {\mathbb {R}}^3$ with body orientation $A\in SO_3({\mathbb {R}})$ at time $t\in {\mathbb {R}}_+$ and where:

$$\begin{aligned} \rho _f(t,x)=\int _{SO_3({\mathbb {R}})} f(t,x,A)\,\mathrm{d}A\,\,\,\,\,\,\text {and}\,\,\,\,\,J_f=\int _{SO_3({\mathbb {R}})} f(t,x,A)A\,\mathrm{d}A \end{aligned}$$

are the respective local density and flux. The measure on $SO_3({\mathbb {R}})$ is the normalised Haar measure, the main properties of which are summarised in Sect. 3.

The left-hand side of the equation models the transport phenomenon: an agent with body orientation $A\in SO_3({\mathbb {R}})$ moves in the direction $Ae_1$ where $e_1$ is the first vector of the canonical basis of ${\mathbb {R}}^3$. The right-hand side of the equation is the BGK operator which models the interactions between the agents: here we assume that f relaxes towards a “moving equilibrium” which takes the form of a von Mises distribution. In particular, the von Mises distribution appears as the analog of the Maxwellian distribution of the classical gas dynamics. The flux $J_f$ plays the same role as the momentum density for gas dynamics or the average flux for the Vicsek model. The term $\rho _f M_{J_f}$ can therefore be seen as the analog of the “Maxwellian distribution with same moments as f” in the context of the BGK equation for gas dynamics.

The main results of this work are (informally) summarised in the two following theorems.

Theorem 1

Let us consider the spatially homogeneous BGK equation:

$$\begin{aligned} \partial _t f=\rho M_{J_f}-f, \end{aligned}$$

where $\rho \in {\mathbb {R}}_+$ is a given density of agents.

1.
The equilibria $f^{\mathrm {eq}}$ of the spatially homogeneous BGK model are either the uniform equilibrium $f^{\mathrm {eq}}=\rho $ or of the form $f^{\mathrm {eq}}=\rho M_{\alpha \Lambda }$ or $f^{\mathrm {eq}}=\rho M_{\alpha \,p\otimes q}$ where $\Lambda \in SO_3({\mathbb {R}})$ and $p,q\in {\mathbb {S}}^2$ and where $\alpha \in {\mathbb {R}}$ and $\rho $ are linked by a compatibility equation to be defined later [see Sect. 4 and Eqs. (19) and (20)].
2.
Depending on the density of agents $\rho \in {\mathbb {R}}_+$, the only stable equilibria are either the uniform equilibrium $f^{\mathrm {eq}}=\rho $ or the equilibria of the form $f^{\mathrm {eq}}=\rho M_{\alpha \Lambda }$ where $\Lambda ~\in ~SO_3({\mathbb {R}})$ and where $\alpha \in {\mathbb {R}}_+$ is linked to $\rho $ by a compatibility equation to be defined later.

The first point of this theorem is detailed in Sect. 4 (see in particular Theorem 5 and Corollary 4.2). The second point is detailed in Sect. 5 (see in particular Theorem 7). We will then prove the following result.

Theorem 2

(Formal) Let us consider the rescaled spatially inhomogeneous problem

$$\begin{aligned} \partial _t f^\varepsilon +(Ae_1\cdot \nabla _x)f^\varepsilon =\frac{1}{\varepsilon }\Big (\rho _{f^\varepsilon } M_{J_{f^\varepsilon }}-f^\varepsilon \Big ), \end{aligned}$$

where

$$\begin{aligned} \rho _f(t,x) = \int _{SO_3({\mathbb {R}})} f(t,x,A)\,\mathrm{d}A\,\,\,\,\,\text {and}\,\,\,\,\,J_f(t,x)=\int _{SO_3({\mathbb {R}})} f(t,x,A)A\,\mathrm{d}A. \end{aligned}$$

1.
We assume that in a disordered region, $f^\varepsilon $ converges as $\varepsilon \rightarrow 0$ towards a density $\rho =\rho (t,x)$ uniform in the body-attitude variable. Then, the density $\rho ^\varepsilon \equiv \rho _{f^\varepsilon }$ satisfies at first order the following diffusion equation:
$$\begin{aligned} \partial _t\rho ^\varepsilon =\varepsilon \nabla _x\cdot \left( \frac{\frac{1}{3}\nabla _x\rho ^\varepsilon }{1-\frac{\rho ^\varepsilon }{\rho _c}}\right) ,\,\,\,\,\,\,\rho _c=6. \end{aligned}$$
2.
We assume that in an ordered region, $f^\varepsilon $ converges as $\varepsilon \rightarrow 0$ towards an equilibrium of the form $\rho M_{\alpha \Lambda }$ with $\rho \in {\mathbb {R}}_+$, $\alpha \in {\mathbb {R}}_+$ and $\Lambda \in SO_3({\mathbb {R}})$ defined above. Then, the density $\rho =\rho (t,x)$ and mean body attitude $\Lambda =\Lambda (t,x)\in SO_3({\mathbb {R}})$ satisfy the SOHB model given by the following system of partial differential equations:
$$\begin{aligned}&\partial _t \rho +\nabla _x\cdot (\rho c_1(\alpha (\rho ))\Lambda e_1)=0, \\&\rho (\partial _t\Lambda +{\tilde{c}}_2((\Lambda e_1)\cdot \nabla _x)\Lambda )+\tilde{c_3}[(\Lambda e_1)\times \nabla _x\rho ]_\times \Lambda \\&\quad +c_4\rho [-{\mathbf {r}}_x(\Lambda )\times (\Lambda e_1)+\delta _x(\Lambda )\Lambda e_1]_\times \Lambda =0. \end{aligned}$$
where $\alpha =\rho c_1(\alpha )$ and ${\tilde{c}}_2$, $\tilde{c_3}$, $c_4$ are functions of $\rho $ to be defined later and $\delta $ and ${\mathbf {r}}$ are the “divergence” and “rotational” operators defined in Degond et al. (2017) (see Sect. 6)

This theorem is detailed in Sect. 6 (see in particular Proposition 6.1 and Theorem 9).

The phase transition problem has been completely treated in the space-homogeneous case for the Vicsek model in Degond et al. (2015), but the geometrical structure inherent to body-orientation models requires specific tools and techniques. In particular, the rotation group $SO_3({\mathbb {R}})$ is a compact Lie group, endowed with a Haar measure. The links between this topological structure and the Riemannian structure (detailed in Sect. 3 and Appendix B) will be the key to reduce the problem to a form that shares structural properties with the models of nematic alignment of polymers, studied in a completely different context to model liquid crystals (Han et al. 2015; Wang and Hoffman 2008; Zhou and Wang 2011; Ball and Majumdar 2010; Ball 2017). These two worlds will be formally linked through the isomorphism between $SO_3({\mathbb {R}})$ and the group of unit quaternions detailed in Sect. 4.2 and Appendix A. It will lead to the first point of Theorem 1 (the complete description of the equilibria, Sect. 4). As in Wang and Hoffman (2008), we will see that there exist a class of equilibria which cannot be interpreted as equilibria around a mean-body orientation. These equilibria were not studied in Degond et al. (2017), Degond et al. (2018b). A key point of the proof will be the reduction to a problem for diagonal matrices which will be a consequence of the left and right invariance of the Haar measure together with an adapted version of the singular value decomposition of a matrix (Definition 3.2).

The stability of the different equilibria is studied in Sect. 5.2. We will show that our model has an underlying gradient-flow structure which will allow us to determine the asymptotic behaviour of the system after a reduction to an ODE in ${\mathbb {R}}^3$. This is a specificity of the BGK model which doesn’t hold for the other models of body-attitude coordination (Degond et al. 2017, 2018b) and allows us to use different and simpler techniques. In particular, we will prove that the equilibria which cannot be interpreted as equilibria around a mean body orientation are always unstable, which tends to justify the analysis carried out in Degond et al. (2017) for a model where only equilibria around a mean body-orientation were considered.

Finally, the SOHB model (Sect. 6) will be obtained as in Degond et al. (2017) by using the GCI. However, compared to Degond et al. (2017), additional terms appear which require a specific treatment and in particular the coefficient $\tilde{c_3}$ that appears in Theorem 2 is different from the one that appears in Degond et al. (2017). The SOHB model (43) raises many questions, most of which are still open, and its mathematical and numerical analyses are still in progress. In particular, the hyperbolicity of the model is currently under study (Degond et al. 2020) and has been shown when $\tilde{c_3}$ is constant.

The organisation of the work is the following: in Sect. 2 we will give a review of the existing models at a microscopic and mesoscopic scales and motivate the study of the BGK equation among them. In Sect. 3, we gather the main technical results we will constantly use throughout this work. In Sect. 4, we will describe, depending on the density, all the possible equilibria of the system. We will use the tools developed to mathematically study the alignment of polymers (Wang and Hoffman 2008; Zhou and Wang 2011). In Sect. 5 we will describe the asymptotic behaviour of the system and in particular which equilibria are attained, leading to a self-organised dynamics or not. This will be based on a specific underlying gradient-flow structure of the BGK equation. Finally in Sect. 6 we will write the macroscopic models for the stable equilibria.

Notations. For the convenience of the reader, we collect here the main notations we will use in the following.

${\mathscr {M}}_n({\mathbb {R}})$ is the set of $n\times n$ real matrices.
${\mathscr {D}}_n({\mathbb {R}})\subset {\mathscr {M}}_n({\mathbb {R}})$ is the subspace of $n\times n$ diagonal real matrices.
${{\,\mathrm{Tr}\,}}(M)$ denotes the trace of the matrix $M\in {\mathscr {M}}_n({\mathbb {R}})$ and $M^\mathrm{T}$ its transpose.
$I_n$ denotes the identity matrix in dimension n.
${{\,\mathrm{diag}\,}}:{\mathbb {R}}^n\rightarrow {\mathscr {D}}_n({\mathbb {R}})$ is the vector space isomorphism such that for $(d_1,\dots ,d_n)\in {\mathbb {R}}^n$, $D={{\,\mathrm{diag}\,}}(d_1,\dots ,d_n)$ is the diagonal matrix, the (i, i)th coefficient of which is equal to $d_i$ for $i\in \{1,\ldots ,n\}$.
${\mathscr {S}}_n({\mathbb {R}})$ and ${\mathscr {A}}_n({\mathbb {R}})$ denote, respectively, the sets of symmetric and skew-symmetric matrices of dimension n.
$SO_n({\mathbb {R}})$ is the special orthogonal group in dimension n, i.e. the group of matrices $P\in {\mathscr {M}}_n({\mathbb {R}})$ such that $PP^\mathrm{T}=I_n$ and $\det P>0$.
${\mathbb {S}}^n\subset {\mathbb {R}}^{n+1}$ is the sphere of dimension n.
${\mathbb {H}}$ is the group of unitary quaternions.
$\langle \cdot \rangle _{g}$ denotes the mean for the probability density g on $SO_n({\mathbb {R}})$. We will simply write $\langle \cdot \rangle $ when g is the uniform probability ($g\equiv 1$).
A will generically be a rotation matrix in $SO_3({\mathbb {R}})$ and $a_{ij}$ its (i, j) coefficient.
PD(M) is the orthogonal part of the polar decomposition of $M\in {\mathscr {M}}_n({\mathbb {R}})$ when $\det M\ne 0$: there exists a unique couple $(PD(M),S)\in {\mathcal {O}}_n({\mathbb {R}})\times {\mathscr {S}}_n({\mathbb {R}})$ such that $M=PD(M)S$. The matrix PD(M) is given by $PD(M)=M\left( \sqrt{M^\mathrm{T}M}\right) ^{-1}$.
For a matrix $M\in {\mathscr {M}}_3({\mathbb {R}})$, the orbit ${{\,\mathrm{Orb}\,}}(M)\subset {\mathscr {M}}_3({\mathbb {R}})$ is defined by:
$$\begin{aligned} {{\,\mathrm{Orb}\,}}(M):=\{PMQ,\,\,\,P,Q\in SO_3({\mathbb {R}})\}. \end{aligned}$$
(1)
${\mathbb {R}}_+:=[0,+\infty )$, ${\mathbb {R}}_+^*:=(0,+\infty )$

2 The BGK Equation and Other Related Models of Self-organisation

In this section we give a review of the different existing models of collective dynamics at both microscopic and mesoscopic levels and emphasise the singularity of the BGK model among them.

2.1 A Review of the Different IBM

The rigorous proofs of the two following theorems (Theorems 3 and 4) can be found in Diez (2019) in a more general framework.

At a microscopic level, we fix a reference frame given by the canonical basis $(e_1,e_2,e_3)$ of ${\mathbb {R}}^3$. The agents are described by their position $X\in {\mathbb {R}}^3$ and their body-attitude $A\in SO_3({\mathbb {R}})$ which can be seen as a moving frame. We assume that an agent with body attitude $A\in SO_3({\mathbb {R}})$ moves at a constant speed in the direction of the first vector of A: the instantaneous velocity of the agent is $A e_1$.

In the following we consider an increasing sequence of jump times $(T_n)_n$ such that the increments between two jumps are independent and follow an exponential law of parameter $N\in {\mathbb {N}}^*$ (their expectation is 1/N). The N agents are described at time $t\in {\mathbb {R}}_+$ by their positions and body attitudes $Z^{N}_t=\big \{(X^{i,N}_t,A^{i,N}_t)\big \}_{i\in \{1,\ldots ,N\}}\in \big ({\mathbb {R}}^3\times SO_3({\mathbb {R}})\big )^N$. The interactions between the agents can be modelled by the following piecewise deterministic Markov process (PDMP) which has already been described heuristically in Degond et al. (2018a), Dimarco and Motsch (2016):

1.
Between two jump times $({T}_n,{T}_{n+1})$, the systems evolves in a deterministic way:
$$\begin{aligned} \forall i\in \{1,\ldots ,N\},\,\,\,\,\,\,\,\mathrm{d}X_t^{i,N}=(A^{i,N}_te_1)\mathrm{d}t,\,\,\,\,\,\,\,\mathrm{d}A^{i,N}_t=0. \end{aligned}$$
2.
At time ${T}_{n+1}$, a particle $i\in \{1,\ldots ,N\}$ is chosen uniformly among the N particles. At time ${T}_{n+1}^+$, the new body-orientation of particle i is sampled from the PDF $M_{J^i\big (Z_{{T}_{n+1}^-}^N\big )}$ where for $Z^N\in ({\mathbb {R}}^3\times SO_3({\mathbb {R}}))^N$ we define the flux:
$$\begin{aligned} J^{i}(Z^N):=\frac{1}{N}\sum _{j=1}^{N} K\Big (|X^{i,N}-X^{j,N}|\Big )A^{j,N}\in {\mathscr {M}}_3({\mathbb {R}}), \end{aligned}$$
and where K is a smooth observation kernel.

The following theorem describes the limiting behaviour of the laws of the particles when $N\rightarrow +\infty $. Under the assumption that the empirical measure of the N processes converges (weakly) towards a smooth function f, we can formally derive the equation satisfied by f as in (Degond et al. 2018a, Section 4.2). The rigorous formulation of this so-called propagation of chaos property has been investigated in Diez (2019).

Theorem 3

Let $f_0$ be a probability measure on the space ${\mathbb {R}}^3\times SO_3({\mathbb {R}})$, and let $Z_0^N\in ({\mathbb {R}}^3\times SO_3({\mathbb {R}}))^N$ be an initial state given by N independent random variables, identically distributed with law $f_0$. Then, for any $t\in {\mathbb {R}}_+$, the law $f^N_t$ of any of one of the processes $(X^{i,N}_t,A^{i,N}_t)_t$ at time t converges weakly towards the solution $f_t$ of the following BGK equation with initial condition $f_0$:

$$\begin{aligned} \partial _t f+(Ae_1\cdot \nabla _x)f=\rho _f M_{J_{K*f}}-f \end{aligned}$$

where $J_{K*f}$ is a matrix-valued function of the space variable $x\in {\mathbb {R}}^3$ defined by

$$\begin{aligned} J_{K*f}(x):=\iint _{{\mathbb {R}}^3\times SO_3({\mathbb {R}})} K(x-y)Af(y,A)dy \mathrm{d}A\in {\mathscr {M}}_3({\mathbb {R}}). \end{aligned}$$

In the following we will focus on a purely local version of the BGK equation obtained in the limit $K\rightarrow \delta _0$. This limit can be made rigorous starting from the IBM and by using an appropriate rescaling of the size of the kernel (called moderate interaction) with respect to the number of particles (see Oelschläger 1985; Jourdain and Méléard 1998 for classical results and Diez (2019) for additional details regarding the present model).

In the previous works, the IBM was typically given as in Degond et al. (2017) by a system of stochastic differential equations such as the following:

$$\begin{aligned}&{\mathrm{d}X_k}=A_k(t)e_1\,\mathrm{d}t, \end{aligned}$$

(2a)

$$\begin{aligned}&{\mathrm{d}A_k}= P_{T_{A_k}}\circ \left( \Big (\frac{1}{N}\sum _{i=1}^{N} K(|X_i-X_k|) A_i\Big )\mathrm{d}t+2\sqrt{D}dB_t\right) . \end{aligned}$$

(2b)

where $P_{T_{A_k}}$ denotes the projection on the tangent space of $SO_3({\mathbb {R}})$ at $A_k\in SO_3({\mathbb {R}})$ (see Sect. 3). In this case, the resulting equation when $N\rightarrow +\infty $ is a nonlinear Fokker–Planck equation (see Bolley et al. 2012 for a rigorous proof in the Vicsek case).

In the spatially homogeneous case, we can take the observation kernel K to be constantly equal to 1 to prove the mean-field limit. The agents are described at time $t\in {\mathbb {R}}_+$ only by their body-attitudes $\big \{A^{i,N}\big \}_{i\in \{1,\ldots ,N\}}\in SO_3({\mathbb {R}})^N$ and they follow the following jump process: at each jump time $T_n$, compute the flux

$$\begin{aligned} J^N_t=\frac{1}{N}\sum _{i=1}^N A_t^{i,N}, \end{aligned}$$

choose a particle $i\in \{1,\ldots N\}$ uniformly among the N particles and draw the new body-orientation $A^{i,N}_{T_n^+}$ after the jump according to the law given by the PDF $M_{J^N_{T_n^-}}$. The following theorem describes analogously the limiting behaviour of the laws of the particles as $N\rightarrow +\infty $.

Theorem 4

Let $\big \{A^{i,N}_0\big \}_{i\in \{1,\ldots ,N\}}\in SO_3({\mathbb {R}})^N$ be an initial state given by N independent random variables, identically distributed according to a law $f_0$ on $SO_3({\mathbb {R}})$. Then for any $t\in {\mathbb {R}}_+$, the law $f^N_t$ of any of one of the processes $(A^{i,N}_t)_t$ at time t converges weakly towards the solution $f_t$ of the following spatially homogeneous BGK equation with initial condition $f_0$:

$$\begin{aligned} \partial _t f=M_{J_{f}}-f. \end{aligned}$$

2.2 A Review of the Different Kinetic Equations

The model studied in the present article belongs to a class of models, the study of which has been initiated in Degond and Motsch (2008) as a continuum version of the Vicsek model (Vicsek et al. (1995)). These models can be classified in two types. First, in the Vicsek-type models, the agents are described by their orientation defined as a unit vector in ${\mathbb {S}}^{n-1}$. In the second type of models, we take into account their body orientation, defined as a rotation matrix in $SO_3({\mathbb {R}})$. Our study enters into this second framework.

The kinetic version of the Vicsek-type or body orientation-type models is given either by a Fokker–Planck equation or by a BGK equation. In this work, we will focus on the BGK equation

$$\begin{aligned} \partial _t f+(Ae_1\cdot \nabla _x)f=\rho _f M_{J_f}-f, \end{aligned}$$

(3)

where

$$\begin{aligned} \rho _f(t,x)=\int _{SO_3({\mathbb {R}})} f(t,x,A)\,\mathrm{d}A\,\,\,\,\,\,\text {and}\,\,\,\,\,J_f=\int _{SO_3({\mathbb {R}})} f(t,x,A)A\,\mathrm{d}A. \end{aligned}$$

The Fokker–Planck version of our model corresponds to:

$$\begin{aligned} \partial _t f +(Ae_1\cdot \nabla _x)f=\nabla _A\cdot \left[ M_{J_f}\nabla _A\left( \frac{f}{M_{J_f}}\right) \right] , \end{aligned}$$

(4)

where $\nabla _A$ and $\nabla _A\cdot $ are, respectively, the gradient and the divergence in $SO_3({\mathbb {R}})$ for the Riemannian structure detailed in Sect. 3. Apart from the fact that the underlying interaction process (Dimarco and Motsch 2016; Degond et al. 2018a) which leads to the BGK model is different from the one that leads to the Fokker–Planck model, the BGK model is structurally different and can be treated independently by using specific and simpler mathematical techniques presented in the next sections. Nevertheless, the BGK and Fokker–Planck models share important properties. For instance, the following functional is a free-energy for both the spatially homogeneous BGK equation and the spatially homogeneous Fokker–Planck equation (though with a different dissipation term):

$$\begin{aligned} {\mathcal {F}}[f]:=\int _{SO_3({\mathbb {R}})} f\log f-\frac{1}{2}|J_f|^2. \end{aligned}$$

(5)

It satisfies in both cases:

$$\begin{aligned} \frac{\mathrm{d}}{\mathrm{d}t}{\mathcal {F}}[f]=-{\mathcal {D}}[f]\le 0, \end{aligned}$$

where ${\mathcal {D}}[f]$ is the dissipation term which is equal for the BGK model to:

$$\begin{aligned} {\mathcal {D}}[f]=\int _{SO_3({\mathbb {R}})} (f-\rho M_{J_f})(\log f-\log (\rho M_{J_f}))\ge 0. \end{aligned}$$

In the context of the Vicsek model, this free energy was the key to study the phase transition phenomena (Degond et al. 2015) and we believe that the same kind of study can be made in the body-attitude coordination dynamics modelled by a Fokker–Planck equation (4). Moreover, in the Fokker–Planck case this dissipation inequality implies a gradient flow structure in the Wasserstein-2 distance which has been studied (in the Vicsek case) in Figalli et al. (2018). However, the BGK model has another underlying gradient-flow dynamics (studied in Sect. 5) on which the present study will be based, and we will therefore not use this free-energy in the present work.

Both models (BGK and Fokker–Planck) have a normalised and a non-normalised version. The model (3) will be referred as the non-normalised BGK model. A normalised model is a model where the flux $J_f$ is replaced by the orthogonal part of its polar decomposition $\Lambda _f:=PD(J_f)$ as defined in the introduction and under the assumption that $\det J_f>0$. The normalised Fokker–Planck model is the model studied in Degond et al. (2017):

$$\begin{aligned} \partial _t f +(Ae_1\cdot \nabla _x)f=\nabla _A\cdot \left[ M_{\Lambda _f}\nabla _A\left( \frac{f}{M_{\Lambda _f}}\right) \right] . \end{aligned}$$

This terminology comes from the continuum version of the Vicsek model (Degond and Motsch 2008) where either the total flux

$$\begin{aligned} J_f=\int _{{\mathbb {S}}^{n-1}}f(t,x,v)v\,dv, \end{aligned}$$

or its normalisation

$$\begin{aligned} \Omega _f=\frac{\int _{{\mathbb {S}}^{n-1}} f(t,x,v)v\,dv}{\left| \int _{{\mathbb {S}}^{n-1}} f(t,x,v)v\,dv\right| }\in {\mathbb {S}}^{n-1}, \end{aligned}$$

is considered. A mathematical analysis of the normalised Vicsek model can be found in Figalli et al. (2018), Gamba and Kang (2016). The importance of this distinction in the context of phase transitions has been shown in Degond et al. (2015) and Degond et al. (2013): phase transitions appear only in non-normalised models.

The following chart (Fig. 1) shows the different models and gives references where they are studied (when such references exist).

Finally, in Sects. 4 and 5, we will focus on the spatially homogeneous version of the BGK model (3) given by:

$$\begin{aligned} \partial _t f = \rho {M_{J_f}}-f, \end{aligned}$$

(6)

where the probability distribution f(t, A) only depends on the body-orientation variable and time. In the spatially homogeneous case, the local density of agents previously denoted by $\rho _f$ does not depend on f in the sense that an initial density $\rho _{f_0}\in {\mathbb {R}}_+$ associated with the initial distribution $f_0$ is preserved by the dynamics:

$$\begin{aligned} \forall t\in {\mathbb {R}}_+,\,\,\,\,\, \rho _f(t)=\rho _{f_0}, \end{aligned}$$

as it can be seen by integrating the equation over $SO_3({\mathbb {R}})$. We therefore take $\rho \in {\mathbb {R}}_+$ as a fixed parameter of the problem. Note also that the well-posedness of (6) directly follows from Duhamel’s formula:

$$\begin{aligned} f(t)=e^{-t}f_0+\rho \int _0^t e^{-(t-s)}M_{J_{f(s)}}\,\mathrm{d}s, \end{aligned}$$

since $J_f$ is given as the solution of the following differential equation on ${\mathscr {M}}_3({\mathbb {R}})$:

$$\begin{aligned} \frac{\mathrm{d}}{\mathrm{d}t}J_f=\rho \langle A\rangle _{M_{J_f}}-J_f,\,\,\,\,\,\,J_f(t=0)=J_{f_0}, \end{aligned}$$

as it can be seen by multiplying (6) by A and integrating over $SO_3({\mathbb {R}})$. Note that it contrasts with the Fokker–Planck case where even the well-posedness of the spatially-homogeneous equation would require further investigations. This will be part of future work.

3 Preliminaries: Structure and Calculus in $SO_n({\mathbb {R}})$

This paragraph collects the main properties of the Riemannian manifold $SO_n({\mathbb {R}})$ and other technical results. In this paragraph, $n\ge 3$ denotes the dimension, and we will mainly consider the case $n=3$ in the next sections.

3.1 Structure and Haar Measure on $SO_n({\mathbb {R}})$

Lemma 3.1

The following is an inner product on ${\mathscr {M}}_n({\mathbb {R}})$:

$$\begin{aligned} A\cdot B:=\frac{1}{2}{{\,\mathrm{Tr}\,}}(A^\mathrm{T} B),\end{aligned}$$

(7)

and the following properties hold:

Endowed with this metric, $SO_n({\mathbb {R}})$ is a topological group and a Riemannian manifold.
The sets ${\mathscr {S}}_n({\mathbb {R}})$ and ${\mathscr {A}}_n({\mathbb {R}})$ of symmetric and skew-symmetric matrices are orthogonal and ${\mathscr {M}}_n({\mathbb {R}})={\mathscr {S}}_n({\mathbb {R}})\oplus {\mathscr {A}}_n({\mathbb {R}})$.
For $A\in SO_n({\mathbb {R}})$, the tangent space to $SO_n({\mathbb {R}})$ at A is denoted by $T_A$ and
$$\begin{aligned} M\in T_A\,\,\,\,\,\text {if and only if there exists}\,\,\,P\in {\mathscr {A}}_n({\mathbb {R}})\,\,\,\text {such that}\,\,\,M=AP. \end{aligned}$$

The norm on ${\mathscr {M}}_n({\mathbb {R}})$ associated with the inner product (7) will be denoted by $\Vert \cdot \Vert $.

Remark 3.1

The unusual scaling factor 1/2 in the definition of the inner product (7) is chosen so that the induced Riemannian distance between a rotation matrix and the identity matrix is exactly equal to the angle of the rotation. Note that without this scaling factor, the matrix-norm induced by (7) is the classical Frobenius norm.

The general theory of locally compact topological groups ensures the existence of a left-invariant Haar measure $\mu $ on $SO_n({\mathbb {R}})$. Since $SO_n({\mathbb {R}})$ is compact thus unimodular, the Haar measure is both left- and right-invariant, which means that it satisfies for all $P\in SO_n({\mathbb {R}})$ and all Borel set ${\mathcal {E}}$ of the Borel $\sigma $-algebra of $SO_n({\mathbb {R}})$:

$$\begin{aligned} \mu (P{\mathcal {E}})=\mu ({\mathcal {E}}P)=\mu ({\mathcal {E}}), \end{aligned}$$

where $P{\mathcal {E}}=\{PA,\,\,A\in {\mathcal {E}}\}$ and ${\mathcal {E}}P=\{AP,\,\,A\in {\mathcal {E}}\}$. We will assume that $\mu $ is the unique Haar measure which is a probability measure and simply write

$$\begin{aligned} \int _{SO_n({\mathbb {R}})} f(A)\,\mathrm{d}\mu (A)\equiv \int _{SO_n({\mathbb {R}})} f(A)\,\mathrm{d}A. \end{aligned}$$

As a consequence if $P\in SO_n({\mathbb {R}})$, $A\mapsto PA$ and $A\mapsto AP$ are two changes of variable with unit Jacobian. We will constantly use the following changes of variable:

Definition 3.1

(Useful changes of variable). Let us define the following matrices:

For $i\ne j\in \{1,\ldots ,n\}$, $D^{ij}\in SO_n({\mathbb {R}})$ is the diagonal matrix such that all its coefficients are equal to 1 except at positions i and j where they are equal to $-1$.
For $i\ne j\in \{1,\ldots ,n\}$, $P^{ij}\in SO_n({\mathbb {R}})$ is the matrix such that $P^{ij}_{ii}=P^{ij}_{jj}=0$, $P^{ij}_{kk}=1$ for $k\ne i,j$, $P^{ij}_{ij}=1$ and $P^{ij}_{ji}=-1$. The other coefficients are equal to 0.

Then, we define the following changes of variable with unit Jacobian:

$A'=D^{ij}A$ multiplies the rows i and j by $-1$. Everything else remains unchanged.
$A'=AD^{ij}$ multiplies the columns i and j by $-1$. Everything else remains unchanged.
$A'=D^{ij} AD^{ij}$ multiplies the elements (k, i), (k, j) and (i, k), (j, k) by $-1$ for $k\ne i,j$. Everything else remains unchanged
$A'=P^{ij}A$ multiplies row i by $-1$ and permutes the rows i and j.
$A'=P^{ij} A(P^{ij})^\mathrm{T}$ exchanges the diagonal coefficients (i, i) and (j, j) (and involves other changes).

The two following lemmas are important applications of these results.

Lemma 3.2

Let $D\in {\mathscr {M}}_n({\mathbb {R}})$ be a diagonal matrix and $M_D$ the von Mises distribution with parameter D, then

$$\begin{aligned} \langle A\rangle _{M_D}:=\int _{SO_3({\mathbb {R}})}\,AM_D(A)\,\mathrm{d}A \end{aligned}$$

is diagonal.

Proof

Let $k\ne \ell $ and $m\ne k,\ell $. The change of variable $A\mapsto D^{km}AD^{km}$ gives:

$$\begin{aligned} \int _{SO_n({\mathbb {R}})} a_{k,\ell }\,e^{D\cdot A}\,\mathrm{d}A=-\int _{SO_3({\mathbb {R}})} a_{k,\ell }\,e^{D\cdot A}\,\mathrm{d}A=0, \end{aligned}$$

where we have used that $D^{k,m}DD^{k,m}=D$. $\square $

Lemma 3.3

For any $n\ge 3$ and any $J\in {\mathscr {M}}_n({\mathbb {R}})$,

$$\begin{aligned} \int _{SO_n({\mathbb {R}})} (J\cdot A)A\,\mathrm{d}A= \frac{1}{2n} J. \end{aligned}$$

Lemma 3.4

Let $n\ge 3$, $n\ne 4$. Let $g:SO_n({\mathbb {R}})\rightarrow {\mathbb {R}}$ such that for all $A,P\in SO_n({\mathbb {R}})$, $g(A)=g(A^\mathrm{T})~=~g(PAP^\mathrm{T})$. For all $J\in {\mathscr {M}}_n({\mathbb {R}})$ we have:

$$\begin{aligned} \int _{SO_n({\mathbb {R}})} (J\cdot A)A\,g(A)\,\mathrm{d}A= a{{\,\mathrm{Tr}\,}}(J) I_n+b J+cJ^\mathrm{T}, \end{aligned}$$

for given $a, b, c\in {\mathbb {R}}$ depending on g and on the dimension, the expressions of which can be found in the proof.

The proof of these lemmas and other technical results about $SO_3({\mathbb {R}})$ and $SO_n({\mathbb {R}})$ are postponed to Appendix B.

3.2 Volume Forms in $SO_3\pmb {{\mathbb {(R)}}}$

When an explicit calculation will be needed, we will use one of the two following parametrisations of $SO_3({\mathbb {R}})$ which give two explicit expressions of the normalised Haar measure in dimension 3.

To a matrix $A\in SO_3({\mathbb {R}})$, there is an associated angle $\theta \in [0,\pi ]$ and a vector ${\mathbf {n}}\in {\mathbb {S}}^2$ such that A is the rotation of angle $\theta $ around the axis ${\mathbf {n}}$. Rodrigues’ formula gives a representation of A knowing $\theta $ and ${\mathbf {n}}=(n_1,n_2,n_3)$:
$$\begin{aligned} A=A(\theta ,{\mathbf {n}})=I_3+ \sin \theta [{\mathbf {n}}]_\times +(1-\cos \theta )[{\mathbf {n}}]_\times ^2=\exp (\theta [{\mathbf {n}}]_\times ), \end{aligned}$$
(8)
where
$$\begin{aligned}{}[{\mathbf {n}}]_\times :=\left( \begin{array}{ccc} 0 &{} -n_3 &{} n_2 \\ n_3 &{}0 &{} -n_1 \\ -n_2 &{} n_1 &{} 0 \end{array}\right) , \end{aligned}$$
and we have:
$$\begin{aligned} {[}{\mathbf {n}}]_\times ^2 = {\mathbf {n}}\otimes {\mathbf {n}}-I_3. \end{aligned}$$
If $f(A(\theta ,{\mathbf {n}}))={\bar{f}}(\theta ,{\mathbf {n}})$, the volume form of $SO_3({\mathbb {R}})$ is given by:
$$\begin{aligned} \int _{SO_3({\mathbb {R}})} f(A)\,\mathrm{d}A= \frac{2}{\pi }\int _0^\pi \sin ^2(\theta /2)\int _{{\mathbb {S}}^2} {\bar{f}}(\theta ,{\mathbf {n}})\,d{\mathbf {n}}\,\mathrm{d}\theta . \end{aligned}$$
With the usual parametrisation of the sphere ${\mathbb {S}}^2$, we can take ${\mathbf {n}}=(n_1,n_2,n_3)^\mathrm{T}$ with
$$\begin{aligned} \left\{ \begin{array}{rcl} n_1&{}=&{}\sin \psi \cos \varphi , \\ n_2&{}=&{}\sin \psi \sin \varphi , \\ n_3&{}=&{} \cos \psi , \end{array}\right. \end{aligned}$$
where $\psi \in [0,\pi ]$ and $\varphi \in [0,2\pi ]$. The volume form for the sphere is given by:
$$\begin{aligned} d{\mathbf {n}}=\frac{1}{4\pi }\sin \psi \mathrm{d}\psi \mathrm{d}\varphi . \end{aligned}$$
We have the following one-to-one map:
$$\begin{aligned} \Psi {:}\,\left| \begin{array}{rcl} SO_{2}({\mathbb {R}})\times {\mathbb {S}}^{2}&{}\longrightarrow &{} SO_3({\mathbb {R}})\\ (A,p)&{}\longmapsto &{} M(p)A^a \end{array}\right. \end{aligned}$$
(9)
where
$$\begin{aligned} A^a:=\left( \begin{array}{cc}A &{} 0 \\ 0 &{} 1 \end{array}\right) \in SO_3({\mathbb {R}}), \end{aligned}$$
and for $p=(\sin \phi _{1}\,\sin \phi _{2},\cos \phi _{1}\,\sin \phi _{2},\cos {\phi _2})^\mathrm{T}$ in spherical coordinates $\phi _{1}\in [0,2\pi ]$ and $\phi _2\in [0,\pi ]$, we define:
$$\begin{aligned} M(p):=\left( \begin{array}{ccc} \cos \phi _{1} &{} \sin \phi _{1}\,\cos \phi _{2} &{} \sin \phi _{1}\,\sin \phi _{2} \\ -\sin \phi _{1} &{} \cos \phi _{1}\,\cos \phi _{2} &{} \cos \phi _{1}\,\sin \phi _{2} \\ 0 &{} -\sin \phi _{2} &{} \cos {\phi _2} \end{array}\right) \in SO_3({\mathbb {R}}). \end{aligned}$$
The matrix $A^a$ performs an arbitrary rotation of the first 2 coordinates, and the matrix $M(p)\in SO_3({\mathbb {R}})$ maps the vector $e_3$ to $p\in {\mathbb {S}}^{2}$. A matrix $A\in SO_3({\mathbb {R}})$ can thus be written as the product:
$$\begin{aligned} \left( \begin{array}{ccc} \cos \phi _{1} &{} \sin \phi _{1}\,\cos \phi _{2} &{} \sin \phi _{1}\,\sin \phi _{2} \\ -\sin \phi _{1} &{} \cos \phi _{1}\,\cos \phi _{2} &{} \cos \phi _{1}\,\sin \phi _{2} \\ 0 &{} -\sin \phi _{2} &{} \cos {\phi _2} \end{array}\right) \left( \begin{array}{ccc} \cos \theta &{} \sin \theta &{} 0 \\ -\sin \theta &{} \cos \theta &{} 0 \\ 0 &{} 0 &{} 1 \end{array}\right) \end{aligned}$$
where $\phi _{1},\theta \in [0,2\pi ]$ and $\phi _2\in [0,\pi ]$. With this parametrisation:
$$\begin{aligned} \int _{SO_3({\mathbb {R}})}f(A)\mathrm{d}A= \frac{1}{2\pi }\int _{0}^{2\pi } \int _{{\mathbb {S}}^2} f\left( M(p)\left( \begin{array}{ccc} \cos \theta &{} \sin \theta &{} 0 \\ -\sin \theta &{} \cos \theta &{} 0 \\ 0 &{} 0 &{} 1 \end{array}\right) \right) \,\mathrm{d}\theta \,dp, \end{aligned}$$
(10)
and the volume form on the sphere is given by:
$$\begin{aligned} dp=\frac{1}{4\pi }\sin \phi _2\,\mathrm{d}\phi _1\,\mathrm{d}\phi _2. \end{aligned}$$

This parametrisation can be extended in any dimension and comes from the Lie groups quotient:

$$\begin{aligned} \frac{SO_n({\mathbb {R}})}{SO_{n-1}({\mathbb {R}})}\cong {\mathbb {S}}^{n-1}. \end{aligned}$$

3.3 Singular Value Decomposition (SVD)

We recall the following classical result proved in (Quarteroni et al. 2010, Section 1.9).

Proposition 3.1

(Singular Value Decomposition, SVD) Any square matrix $M\in {\mathscr {M}}_n({\mathbb {R}})$ can be written:

$$\begin{aligned} M=PDQ \end{aligned}$$

where $P,Q\in {\mathcal {O}}_n({\mathbb {R}})$ and D diagonal with non-negative coefficients listed in decreasing order.

In order to use the properties of the Haar measure, we will need the matrices P and Q to belong to $SO_3({\mathbb {R}})$ (not only ${\mathcal {O}}_3({\mathbb {R}}))$ and we define therefore another decomposition, called the special singular value decomposition (SSVD) in the following.

Definition 3.2

(SSVD in $SO_3({\mathbb {R}})$). Let $M\in {\mathscr {M}}_3({\mathbb {R}})$. A special singular value decomposition (SSVD) of M is a decomposition of the form

$$\begin{aligned} M=PDQ \end{aligned}$$

where $P,Q\in SO_3({\mathbb {R}})$ and $D={{\,\mathrm{diag}\,}}(d_1,d_2,d_3)$ with

$$\begin{aligned} d_1\ge d_2\ge |d_3|. \end{aligned}$$

The existence of a SSVD follows from Proposition 3.1. Let us start from a SVD

$$\begin{aligned} M=P'D'Q'. \end{aligned}$$

If $\det M>0$, either $P',Q'\in SO_3({\mathbb {R}})$ and the SVD is a SSVD or $P',Q'$ have both negative determinant and in this case we can take
$$\begin{aligned} P=P'{\tilde{D}},\,\,\,\,Q={\tilde{D}}Q'\,\,\,\,\text {and}\,\,\,\,D=D' \end{aligned}$$
where ${\tilde{D}}={{\,\mathrm{diag}\,}}(1,1,-1)$.
If $\det M<0$, either $P'\in SO_3({\mathbb {R}})$ or $Q'\in SO_3({\mathbb {R}})$ (only one of them). Assume without loss of generality that $Q'\in SO_3({\mathbb {R}})$. Then, we can take:
$$\begin{aligned} P=P'{\tilde{D}},\,\,\,\, D={\tilde{D}}D\,\,\,\,\text {and}\,\,\,\,Q=Q'. \end{aligned}$$
If $\det M=0$, then the last coefficient of $D'$ is equal to 0 so ${\tilde{D}}D'=D'$ and $D'{\tilde{D}}=D'$. We can take $D=D'$. If $P'\notin SO_3({\mathbb {R}})$ we can take $P=P'{\tilde{D}}$ and if $Q'\notin SO_3({\mathbb {R}})$ we can take $Q={\tilde{D}}Q'$.

Remark 3.2

As for the polar decomposition and the standard SVD, the matrix D is always unique. However, the matrices P and Q may not be unique.

The subset ${\mathscr {D}}\subset {\mathscr {M}}_3({\mathbb {R}})$ of the diagonal matrices which are the diagonal part of a SSVD is the cone delimited by the image by the isomorphism ${{\,\mathrm{diag}\,}}$ of the three planes $\{d_1=d_2\}$, $\{d_2=d_3\}$ and $\{d_2=-d_3\}$ in ${\mathbb {R}}^3$ and depicted in Fig. 3:

$$\begin{aligned} D={{\,\mathrm{diag}\,}}(d_1,d_2,d_3)\in {\mathscr {D}}\,\,\,\text {if and only if}\,\,\,d_1\ge d_2\ge |d_3|. \end{aligned}$$

(11)

4 Equilibria of the BGK Operator

In this section we determine the equilibria for the BGK operator:

$$\begin{aligned} Q_\mathrm{BGK}(f):=\rho M_{J_f}-f, \end{aligned}$$

(12)

that is to say the distributions f such that $Q_\mathrm{BGK}(f)=0$. In Sect. 4.1 we characterise these equilibria (Theorem 5) and show that for them to exist, compatibility equations must be fulfilled. These compatibility equations depend on the density $\rho $. Therefore, for different values of the density $\rho $, there exists different equilibria. These will be determined in Sect. 4.2 by studying the compatibility equations. A full description of the equilibria of the BGK operator is finally given in Corollary 4.2.

4.1 Characterisation of the Equilibria and Compatibility Equations

The main result of this section is Theorem 5 which gives all the equilibria of the BGK operator (12). Before stating and proving it we will need the following lemma which is the analog of Lemma 4.4 in Degond et al. (2017). The proof of this lemma is an application of the results presented in Sect. 3.

Lemma 4.1

(Consistency relations) The following holds:

(i)
There exists a function $c_1=c_1(\alpha )$ defined for all $\alpha \in {\mathbb {R}}$ such that for all $\Lambda \in SO_3({\mathbb {R}})$,
$$\begin{aligned} c_1(\alpha )\Lambda =\langle A\rangle _{M_{\alpha \Lambda }}. \end{aligned}$$
(13)
The function $c_1$ can be explicitly written $c_1(\alpha )=\frac{1}{3}\big \{ (2\cos \theta +1)\big \}_{\alpha }$ where $\{\cdot \}_\alpha $ denotes the mean with respect to the probability density
$$\begin{aligned} \theta \in [0,\pi ]\longmapsto \frac{\sin ^2(\theta /2) e^{\alpha \cos \theta }}{\int _0^\pi \sin ^2(\theta '/2) e^{\alpha \cos \theta '}\,\mathrm{d}\theta '}. \end{aligned}$$
(14)
(ii)
Consider the set ${\mathscr {B}}\subset {\mathscr {M}}_3({\mathbb {R}})$ defined by:
$$\begin{aligned} {\mathscr {B}}:=\left\{ B=P\left( \begin{array}{ccc}1 &{} &{} \\ &{} 0 &{} \\ &{} &{} 0 \end{array}\right) Q,\,\,\,\,\,P,Q\in SO_3({\mathbb {R}})\right\} =\{p\otimes q,\,\,\,p,q\in {\mathbb {S}}^2\}. \end{aligned}$$
There exists a function $c_2=c_2(\alpha )$ defined for all $\alpha \in {\mathbb {R}}$ such that for all $B\in {\mathscr {B}}$,
$$\begin{aligned} c_2(\alpha )B=\langle A\rangle _{M_{\alpha B}}. \end{aligned}$$
(15)
The function $c_2$ can be explicitly written: $c_2(\alpha )=[ \cos \phi ]_\alpha $, where $[\cdot ]_\alpha $ denotes the mean with respect to the probability density
$$\begin{aligned} \varphi \in [0,\pi ]\,\longmapsto \frac{\sin \varphi \,e^{\frac{\alpha }{2}\cos \varphi }}{\int _0^\pi \sin \varphi '\,e^{\frac{\alpha }{2}\cos \varphi '}\,\mathrm{d}\varphi '}. \end{aligned}$$
(16)

Remark 4.1

The relevance of the set ${\mathscr {B}}$ will become apparent in Proposition 4.2.

Proof

(i)
Using the left invariance of the Haar measure, it is enough to prove the result for $\Lambda = I_3$, since
$$\begin{aligned} \langle A\rangle _{M_{\alpha \Lambda }}=\frac{\int _{SO_3({\mathbb {R}})} A e^{\alpha A\cdot \Lambda }\,\mathrm{d}A}{\int _{SO_3({\mathbb {R}})} e^{\alpha \Lambda \cdot A}\,\mathrm{d}A}=\Lambda \frac{\int _{SO_3({\mathbb {R}})} \Lambda ^\mathrm{T} A e^{\alpha \Lambda ^\mathrm{T}A\cdot I_3}\,\mathrm{d}A}{\int _{SO_3({\mathbb {R}})} e^{\alpha \Lambda ^\mathrm{T}A\cdot I_3}\,\mathrm{d}A}=\Lambda \langle A\rangle _{M_{\alpha I_3}}. \end{aligned}$$
When $\Lambda =I_3$, Lemma 3.2 first ensures that $\langle A\rangle _{M_{\alpha I_3}}$ is diagonal, then the change of variable $A'=P^{12}A(P^{12})^\mathrm{T}$ (see Definition 3.1) shows that:
$$\begin{aligned} \langle a_{11}\rangle _{M_{\alpha I_3}}=\langle a_{22}\rangle _{M_{\alpha I_3}}. \end{aligned}$$
Proceeding analogously with the other coefficients, we have that $\langle A\rangle _{M_{\alpha I_3}}$ is proportional to $I_3$, i.e. there exists $c_1=c_1(\alpha )\in {\mathbb {R}}$ such that
$$\begin{aligned} c_1(\alpha )I_3=\langle A\rangle _{\alpha I_3}. \end{aligned}$$
(17)
The parametrisation of $SO_3({\mathbb {R}})$ using Rodrigues’ formula (8) then gives the explicit expression of $c_1$ by taking the trace in Eq. (17) and using that for $A=A(\theta ,{\mathbf {n}})$, ${{\,\mathrm{Tr}\,}}(A)=2\cos \theta +1$.
(ii)
As before, using the left and right invariance of the Haar measure it is enough to prove the result for $B={{\,\mathrm{diag}\,}}(1,0,0)$. Now if $D={{\,\mathrm{diag}\,}}(a,b,-b)$ for $a,b\in {\mathbb {R}}$, then the change of variable $A\mapsto P^{23}A(P^{23})^\mathrm{T}$ followed by the change of variable $A\mapsto D^{23}A$ (see Definition 3.1) shows that
$$\begin{aligned} \int _{SO_3({\mathbb {R}})} a_{22} e^{D\cdot A}\,\mathrm{d}A=-\int _{SO_3({\mathbb {R}})} a_{33} e^{D\cdot A}\,\mathrm{d}A, \end{aligned}$$
which proves with Lemma 3.2 that $\langle A\rangle _{M_D}$ is diagonal of the form ${{\,\mathrm{diag}\,}}({\tilde{a}},{\tilde{b}},-{\tilde{b}})$ for ${\tilde{a}},{\tilde{b}}\in {\mathbb {R}}$. Similarly, if $D={{\,\mathrm{diag}\,}}(a,b,b)$, then $\langle A\rangle _{M_D}$ is of the form ${{\,\mathrm{diag}\,}}({\tilde{a}},{\tilde{b}},{\tilde{b}})$. These two results prove that $\langle A\rangle _{M_{\alpha B}}$ is proportional to B, i.e. there exists $c_2=c_2(\alpha )\in {\mathbb {R}}$ such that (15) holds. The parametrisation of $SO_3({\mathbb {R}})$ coming from the isomorphism (9) then gives the explicit expression of $c_2$ by taking $B={{\,\mathrm{diag}\,}}(1,0,0)$ in Eq. (15). First, using the change of variable $A\mapsto P^{13}A(P^{13})^\mathrm{T}$ it holds that
$$\begin{aligned} c_2(\alpha )=\frac{1}{Z}\int _{SO_3({\mathbb {R}})}a_{11}e^{\frac{\alpha }{2}a_{11}}\,\mathrm{d}A=\frac{1}{Z}\int _{SO_3({\mathbb {R}})}a_{33}e^{\frac{\alpha }{2}a_{33}}\,\mathrm{d}A \end{aligned}$$
where
$$\begin{aligned} Z=\int _{SO_3({\mathbb {R}})}e^{\frac{\alpha }{2}a_{11}}\,\mathrm{d}A=\int _{SO_3({\mathbb {R}})}e^{\frac{\alpha }{2}a_{33}}\,\mathrm{d}A. \end{aligned}$$
Then, using the parametrisation (10), it follows that:
$$\begin{aligned} c_2(\alpha )=\frac{\int _{0}^\pi \cos \varphi \sin \varphi e^{\frac{\alpha }{2}\cos \varphi }\,\mathrm{d}\varphi }{\int _{0}^\pi \sin \varphi e^{\frac{\alpha }{2}\cos \varphi }\,\mathrm{d}\varphi }. \end{aligned}$$

$\square $

Remark 4.2

We could alternatively use one of the two parametrisations of $SO_3({\mathbb {R}})$ given in Sect. 3.2 or the quaternion formulation to prove that $\langle A\rangle _{\alpha I_3}$ and $\langle A\rangle _{\alpha B}$ are proportional to $I_3$ and B. However, the proof that we have just presented here holds in any dimension (the value of the constants $c_1(\alpha )$ and $c_2(\alpha )$ depends on the dimension but not the form of the matrices), whereas the volume forms and the quaternion formulation strongly depend on the dimension $n=3$.

We can now state the main result of this section:

Theorem 5

(Equilibria for the homogeneous Body-Orientation BGK equation) Let $\rho ~\in ~{\mathbb {R}}_+$ be a given density. The equilibria of the spatially homogeneous BGK equation (6) are the distributions of the form $f=\rho M_J$ where $J\in {\mathscr {M}}_3({\mathbb {R}})$ is a solution of the matrix compatibility equation:

$$\begin{aligned} J=\rho \langle A\rangle _{M_J}. \end{aligned}$$

(18)

The solutions of the compatibility equation (18) are:

1.
the matrix $J=0$,
2.
the matrices of the form $J=\alpha \Lambda $ with $\Lambda \in SO_3({\mathbb {R}})$ and where $\alpha \in {\mathbb {R}}$ satisfies the scalar compatibility equation
$$\begin{aligned} \alpha =\rho c_1(\alpha ), \end{aligned}$$
(19)
3.
the matrices of the form $J=\alpha B$ where $B\in {\mathscr {B}}$ and where $\alpha \in {\mathbb {R}}$ satisfies the scalar compatibility equation
$$\begin{aligned} \alpha =\rho c_2(\alpha ), \end{aligned}$$
(20)

where the set ${\mathscr {B}}$ and the functions $c_1$ and $c_2$ are defined in Lemma 4.1.

Remark 4.3

Notice that the existence of a nonzero solution for the scalar compatibility equations (19) and (20) is not guaranteed for all values of $\rho >0$. The existence of nonzero solutions for these equations will be explored in Sect. 4.2. They will determine the existence of equilibria for Eq. (6) for a given value of $\rho $ (Corollary 4.2).

Remark 4.4

The fact that these matrices are solutions of the matrix compatibility equation (18) follows directly from the consistency relations (13) and (15) as it will be shown in the proof of Theorem 5. The main difficulty of the proof is therefore the necessary condition: we will prove that a solution of the matrix compatibility equation (18) is necessarily of one of the forms listed in Theorem 5.

Remark 4.5

(Physical interpretation of the equilibria). The first two types of equilibria can be interpreted as statistical descriptions of, respectively, a disordered state (case $J=0$) or an ordered (or flocking) state where a physical average body-orientation $\Lambda \in SO_3({\mathbb {R}})$ can be identified and where $\alpha $ plays the role of a concentration parameter around the “mean” value $\Lambda $. Note that $\alpha $ depends on $\rho $. In the following, it will be shown that the flocking equilibrium is stable when the density $\rho $ is sufficiently large. Moreover, the concentration parameter $\alpha $ will be larger for larger values of the density (after a certain threshold of the density). The third type of equilibria is specific to the body-orientation model and can be physically understood as the statistical description of a system composed of two groups of agents moving in the same direction but where one group is oriented upside-down with respect to the other. The diagonal element ${{\,\mathrm{diag}\,}}(1,0,0)\in {\mathscr {B}}$ can indeed be obtained as the arithmetic average of the body-orientations ${{\,\mathrm{diag}\,}}(1,1,1)\in SO_3({\mathbb {R}})$ and ${{\,\mathrm{diag}\,}}(1,-1,-1)\in SO_3({\mathbb {R}})$. It corresponds to the case of two agents moving in the $e_1$ direction and where the body orientation of an agent can be obtained from the other by a rotation of angle $\pi $ around the $e_1$ axis. As it can be expected, equilibria of this type will be shown to be always unstable.

The proof of this theorem will use the two following propositions. The first one and its corollary (Proposition 4.1 and Corollary 4.1) show that the compatibility equation (18) can be reduced to a compatibility equation on diagonal matrices [equation (21)]. The second one (Proposition 4.2) provides a necessary condition for a diagonal matrix to be a solution of (21). The proof of Proposition 4.2 is deferred to the next section.

Proposition 4.1

(Orbital reduction) The following equivalence holds: $J\in {\mathscr {M}}_3({\mathbb {R}})$ is a solution of the matrix compatibility equation (18) if and only if for all $J'\in {{\,\mathrm{Orb}\,}}(J)$, $J'$ is a solution of the matrix compatibility equation (18).

Proof

This is a consequence of the left and right invariance of the Haar measure which ensures that for any $J\in {\mathscr {M}}_3({\mathbb {R}})$ and any $P,Q\in SO_3({\mathbb {R}})$:

$$\begin{aligned} \langle PAQ\rangle _{M_J}=\langle A\rangle _{M_{PJQ}.} \end{aligned}$$

$\square $

Since the diagonal part of the SSVD of a matrix J is in the orbit of J, we obtain the following corollary:

Corollary 4.1

(Reduction to diagonal matrices) Let $J\in {\mathscr {M}}_3({\mathbb {R}})$ with SSVD given by $J~=~PDQ$. The following equivalence holds: J is a solution of (18) if and only if D is a solution of (18).

We will therefore consider only the following problem in dimension 3: find all the diagonal matrices $D\in {\mathscr {M}}_3({\mathbb {R}})$ such that

$$\begin{aligned} \left\{ \begin{array}{l} D=\rho \langle A\rangle _{M_D}\\ D\in {\mathscr {D}}, \end{array}\right. \end{aligned}$$

(21)

where the set ${\mathscr {D}}$ is the subset of diagonal matrices which are the diagonal part of a SSVD and is defined by (11). Notice that Eq. (21) is just Eq. (18) restricted to the set ${\mathscr {D}}$.

Remark 4.6

The diagonal part $D\in {\mathscr {M}}_3({\mathbb {R}})$ of a SSVD of a matrix $J\in {\mathscr {M}}_3({\mathbb {R}})$ is unique so the problems (18) and (21) are equivalent. Notice that there might be other diagonal matrices in ${{\,\mathrm{Orb}\,}}(J)$ [take, for example, J diagonal which does not satisfy the conditions (11)]. However, the diagonal part of any SSVD of these matrices is D: the diagonal part of the SSVD characterises the orbit of a matrix. In the following, we will find all the diagonal solutions of (18) (i.e. the solutions of (21) without the restriction $D\in {\mathscr {D}}$) and then only consider the ones which belong to ${\mathscr {D}}$. For instance, we will see that there are solutions of (18) of the form ${{\,\mathrm{diag}\,}}(0,-\alpha ,0)$ where $\alpha >0$. The diagonal part of their SSVD is ${{\,\mathrm{diag}\,}}(\alpha ,0,0)$ and is a solution of (21).

Remark 4.7

A diagonal solution D of the matrix compatibility equation (18) verifies that $D/\rho $ belongs to the set:

$$\begin{aligned} \Omega =\Big \{D={{\,\mathrm{diag}\,}}(d_1,d_2,d_3),\,\,\,\exists \,\,f\in {\mathcal {P}}(SO_3({\mathbb {R}})),\,\,\,J_f=D\Big \}\subset {\mathscr {D}}_3({\mathbb {R}}), \end{aligned}$$

where ${\mathcal {P}}(SO_3({\mathbb {R}}))$ is the set of probability measures on $SO_3({\mathbb {R}})$. The set ${{\,\mathrm{diag}\,}}^{-1}(\Omega )\subset {\mathbb {R}}^3$ is exactly the tetrahedron ${\mathscr {T}}$ defined as the convex hull of the points $(\pm 1,\pm 1,\pm 1)$ with an even number of minuses (which we will call Horn’s tetrahedron). It is a consequence of Horn’s theorem (Horn 1954, Theorem 8) which states that ${\mathscr {T}}$ is exactly the set of vectors which are the diagonal of an element of $SO_3({\mathbb {R}})$. It ensures that if f is a probability measure, we have by convexity of ${\mathscr {T}}$:

$$\begin{aligned} \int _{SO_3({\mathbb {R}})} f(A) A\,\mathrm{d}A\in {{\,\mathrm{diag}\,}}({\mathscr {T}}) \end{aligned}$$

and therefore ${{\,\mathrm{diag}\,}}^{-1}(\Omega )\subset {\mathscr {T}}$. Conversely, taking the Dirac deltas $\delta _{I_3}$ and similarly for the other vertices of ${\mathscr {T}}$, we see that the four vertices of Horn’s tetrahedron belong to ${{\,\mathrm{diag}\,}}^{-1}(\Omega )$. Since $\Omega $ is convex, we conclude that ${\mathscr {T}}\subset {{\,\mathrm{diag}\,}}^{-1}(\Omega )$.

The diagonal solutions of the matrix compatibility equation (18) satisfy the following necessary condition.

Proposition 4.2

The diagonal solutions of the compatibility equation (18) are necessarily of one of the following types:

(a)
$D=0$.
(b)
$D=\alpha {{\,\mathrm{diag}\,}}(\pm 1,\pm 1,\pm 1)$ with an even number of minus signs and where $\alpha \in {\mathbb {R}}\setminus \{0\}$.

If $\alpha \in (0,+\infty )$, the diagonal part of the SSVD of these diagonal matrices is equal to $D=\alpha I_3$.

If $\alpha \in (-\infty ,0)$, the diagonal part of the SSVD of these diagonal matrices is equal to $D=\alpha {{\,\mathrm{diag}\,}}(-1,-1,1)=|\alpha |{{\,\mathrm{diag}\,}}(1,1,-1)$.
(c)
$D=\alpha {{\,\mathrm{diag}\,}}(\pm 1,0,0)$ and the matrices obtained by permutation of the diagonal coefficients and where $\alpha \in {\mathbb {R}}\setminus \{0\}$.

The diagonal part of the SSVD of these diagonal matrices is equal to $D={{\,\mathrm{diag}\,}}(|\alpha |,0,0)$.

Section 4.2 will be devoted to the proof of this proposition. We are now ready to prove Theorem 5.

Proof of Theorem 5

An equilibria of the BGK equation is of the form

$$\begin{aligned} f=\rho M_J, \end{aligned}$$

where

$$\begin{aligned} J=J_f =\rho \langle A\rangle _{M_J}. \end{aligned}$$

It is straightforward to check that $J=0$ is a solution of (18). Now, let D a matrix of one, the form described in Proposition 4.2 with a parameter $\alpha \in {\mathbb {R}}$. For instance, for a matrix of type (c) like $D=\alpha {{\,\mathrm{diag}\,}}(0,-1,0)$, thanks to Lemma 4.1 we have:

$$\begin{aligned} D=\rho \langle A\rangle _{M_D}\,\Longleftrightarrow \,D=\rho c_2(\alpha ) {{\,\mathrm{diag}\,}}(0,-1,0)\,\Longleftrightarrow \,\alpha =\rho c_2(\alpha ). \end{aligned}$$

Similarly, for the other diagonal matrices of type (c), we prove that they are solution of the matrix compatibility equation (18) if and only if their parameter $\alpha \in {\mathbb {R}}$ is solution of the scalar compatibility equation (20). Analogously, one can check that the diagonal matrices of type (b) are solutions of the matrix compatibility equation (18) if and only if their parameters $\alpha \in {\mathbb {R}}$ are solutions of the scalar compatibility equation (19). This yields all the diagonal solutions of (18). Now, the solutions of (18) are exactly the matrices $J\in {{\,\mathrm{Orb}\,}}(D)$ where D is a diagonal solution of (18) and the set ${{\,\mathrm{Orb}\,}}(D)\subset {\mathscr {M}}_3({\mathbb {R}})$ is the orbit of D defined in the introduction. We conclude by noticing that if D is of type (b) then ${{\,\mathrm{Orb}\,}}(D)= SO_3({\mathbb {R}})$ and if D is of type (c) then ${{\,\mathrm{Orb}\,}}(D)={\mathscr {B}}$. $\square $

Remark 4.8

When applied to diagonal matrices, the last part of Theorem 5 states that the diagonal solutions of (18) are necessarily of one of the types (a), (b) or (c) defined in Proposition 4.2 and that, it holds that

1.
the matrix 0 is always a solution of (18),
2.
a matrix of type (b) is a solution of (18) iff its parameter $\alpha \in {\mathbb {R}}\setminus \{0\}$ satisfies (19),
3.
a matrix of type (c) is a solution of (18) iff its parameter $\alpha \in {\mathbb {R}}\setminus \{0\}$ satisfies (20).

4.2 Proof of Proposition 4.2

The proof of Proposition 4.2 is based on two results. The first one has been proved in (Wang and Hoffman 2008, Section 4) to study the nematic alignment of polymers in higher-dimensional spaces:

Theorem 6

(Wang and Hoffman 2008). Let $n\ge 3$, $b\in {\mathbb {R}}_+$ and ${\mathbf {s}}=(s_1,s_2,\ldots ,s_n)\in {\mathbb {R}}^n$ a solution of the nonlinear system

$$\begin{aligned} s_j=\langle m_j^2\rangle _{g_{{\mathbf {s}},b}},\,\,\,\,\,j=1,\ldots ,n, \end{aligned}$$

(22)

where the average is taken with respect to the PDF on the sphere ${\mathbb {S}}^{n-1}$:

$$\begin{aligned} g_{{\mathbf {s}},b}(m_1,\ldots ,m_n):=\frac{1}{Z}\exp \left( b\sum _{j=1}^{n} s_j m_j^2\right) , \end{aligned}$$

(23)

where Z is the normalisation constant which ensures that $g_{{\mathbf {s}},b}$ is a PDF on the sphere ${\mathbb {S}}^{n-1}$. Then, ${{\,\mathrm{Card}\,}}\{s_1,s_2,\ldots ,s_n\}\le 2$.

The second tool that we will use to prove Proposition 4.2 is an isomorphism between $SO_3({\mathbb {R}})$ and the space of unitary quaternions which transforms the compatibility equation (21) into the compatibility equation (22) studied in Theorem 6.

Proposition 4.3

1.
There is an isomorphism between the group $SO_3({\mathbb {R}})$ and the quotient group ${\mathbb {H}}/\pm 1$, where ${\mathbb {H}}$ is the group of unit quaternions. Since ${\mathbb {H}}$ is homeomorphic to ${\mathbb {S}}^3$, there is an isomorphism $\Phi $:
$$\begin{aligned} \Phi : {\mathbb {S}}^3/\pm 1 \longrightarrow SO_3({\mathbb {R}}). \end{aligned}$$
Moreover, $\Phi $ is an isometry in the sense that it maps the volume form of ${\mathbb {S}}^3/\pm 1$ (defined as the image measure of the usual measure on ${\mathbb {S}}^3$ by the projection on the quotient space) to the volume form on $SO_3({\mathbb {R}})$: for all measurable function f on $SO_3({\mathbb {R}})$,
$$\begin{aligned} \int _{{\mathbb {S}}^3/\pm 1} f\big (\Phi (q)\big )\,dq = \int _{SO_3({\mathbb {R}})} f(A)\,\mathrm{d}A. \end{aligned}$$
2.
There is a linear isomorphism between the vector space ${\mathscr {M}}_3({\mathbb {R}})$ and the vector space ${\mathscr {S}}_4^0({\mathbb {R}})$ of trace free symmetric matrices of dimension 4:
$$\begin{aligned} \phi ~: {\mathscr {M}}_3({\mathbb {R}}) \longrightarrow {\mathscr {S}}_4^0({\mathbb {R}}), \end{aligned}$$
such that for all $J\in {\mathscr {M}}_3({\mathbb {R}})$, and $q\in {\mathbb {H}}/\pm 1$,
$$\begin{aligned} \frac{1}{2} J\cdot \Phi (q) = q\cdot \phi (J)q. \end{aligned}$$
The first dot product is defined by Eq. (7), and the second one is the usual dot product in ${\mathbb {R}}^4$.
3.
For all $q\in {\mathbb {H}}/\pm 1$, it holds that $\phi \big (\Phi (q)\big )=q\otimes q-\frac{1}{4}I_4$.
4.
The isomorphism $\phi $ preserves the diagonal structure: if $D={{\,\mathrm{diag}\,}}(d_1,d_2,d_3)$, then
$$\begin{aligned} \phi (D)=\frac{1}{4}\left( \begin{array}{cccc} d_1+d_2+d_3 &{} 0 &{} 0 &{} 0 \\ 0 &{} d_1-d_2-d_3 &{} 0 &{} 0 \\ 0 &{} 0 &{} -d_1+d_2-d_3 &{} \\ &{} 0 &{} 0 &{} -d_1-d_2+d_3 \end{array}\right) \end{aligned}$$
and if $Q={{\,\mathrm{diag}\,}}(s_1,s_2,s_3,s_4)$ with $s_1+s_2+s_3+s_4=0$, then
$$\begin{aligned} \phi ^{-1}(Q)=2\left( \begin{array}{ccc} {s_1+s_2} &{} 0 &{} 0\\ 0 &{} {s_1+s_3} &{} 0 \\ 0 &{} 0 &{} {s_1+s_4} \end{array} \right) . \end{aligned}$$

The proof of this proposition can be found in Appendix A. We are now ready to prove Proposition 4.2.

Proof of Proposition 4.2

Using the first and second points of Proposition 4.3, it holds that

$$\begin{aligned} \int _{SO_3({\mathbb {R}})} A e^{A\cdot D}\,\mathrm{d}A = \int _{{\mathbb {S}}^3/\pm 1} \Phi (q)e^{\Phi (q)\cdot D}\,dq = \int _{{\mathbb {S}}^{3}/\pm 1}\Phi (q)e^{2q\cdot \phi (D)q}\,dq. \end{aligned}$$

The compatibility equation (21) then becomes:

$$\begin{aligned} D=\frac{\rho }{Z}\int _{{\mathbb {S}}^{3}/\pm 1}\Phi (q)e^{2q\cdot \phi (D)q}\,dq. \end{aligned}$$

Applying the isomorphism $\phi $ defined in Proposition 4.3 to this last equation, we can exchange $\phi $ and the integral by linearity and we obtain thanks to the third point of Proposition 4.3:

$$\begin{aligned} \phi (D)=\frac{\rho }{Z}\int _{{\mathbb {S}}^{3}/\pm 1}\phi \big (\Phi (q)\big )e^{2q\cdot \phi (D)q}\,dq=\frac{\rho }{Z}\int _{{\mathbb {S}}^{3}/\pm 1}\left( q\otimes q-\frac{1}{4}I_4\right) e^{2q\cdot \phi (D)q}\,dq. \end{aligned}$$

Using the fourth point of Proposition 4.3, we then obtain the following equivalent problem: find all the trace-free diagonal matrices $Q={{\,\mathrm{diag}\,}}(s_1,s_2,s_3,s_4)$ of dimension 4 such that

$$\begin{aligned} Q=\rho \frac{\int _{{\mathbb {S}}^3/\pm 1} e^{\sum _{i=1}^{4} 2s_iq_i^2}(q\otimes q-\frac{1}{4} I_4)\,dq}{Z}, \end{aligned}$$

where Z is a normalisation constant:

$$\begin{aligned} Z:=\int _{{\mathbb {S}}^3/\pm 1} e^{\sum _{i=1}^{4} 2s_iq_i^2}\,dq. \end{aligned}$$

Equivalently, defining for $i\in \{1,2,3,4\}$:

$$\begin{aligned} s_i':=\frac{s_i}{\rho }+\frac{1}{4}, \end{aligned}$$

we want to solve the system of compatibility equations:

$$\begin{aligned} s_i' = \int _{{\mathbb {S}}^3/\pm 1} q_i^2g_{{\mathbf {s}}',2\rho }(q)\,dq,\,\,\,\,\,\,\,i=1,2,3,4, \end{aligned}$$

(24)

where ${\mathbf {s}}'=(s_1',s_2',s_3',s_4')$ and $g_{{\mathbf {s}}',2\rho }$ is given by (23). Thanks to Theorem 6, we conclude that if ${\mathbf {s}}'$ is a solution of (24), then the coefficients $s_1',s_2',s_3',s_4'$ can take at most two distinct values. So, the same result holds for the coefficients $s_1,s_2,s_3,s_4$. Now thanks to the fourth point of Proposition 4.3, we only have the following possibilities:

if $s_1=s_2=s_3=s_4=0$, then
$$\begin{aligned} D=\phi ^{-1}(Q)=0, \end{aligned}$$
if $s_1=3\alpha /4$ and $s_2=s_3=s_4=-\alpha /4$ for $\alpha \in {\mathbb {R}}$, then
$$\begin{aligned} D=\phi ^{-1}(Q)=\alpha I_3, \end{aligned}$$
if $s_2=3\alpha /4$ and $s_1=s_3=s_4=-\alpha /4$ for $\alpha \in {\mathbb {R}}$, then
$$\begin{aligned} D=\phi ^{-1}(Q)=\alpha \left( \begin{array}{ccc}1 &{} 0 &{} 0 \\ 0 &{} -1 &{} 0 \\ 0 &{} 0 &{} -1 \end{array}\right) , \end{aligned}$$
and similarly by permuting the diagonal elements when $s_3=3\alpha /4$ and when the other elements are equal $s_1=s_2=s_4=-\alpha /4$ or when $s_4=3\alpha /4$ and $s_1=s_2=s_3=-\alpha /4$,
if $s_1=s_2=\alpha /4$ and $s_3=s_4=-\alpha /4$ for $\alpha \in {\mathbb {R}}$, then
$$\begin{aligned} D=\phi ^{-1}(Q)=\alpha \left( \begin{array}{ccc}1 &{} 0 &{} 0 \\ 0 &{} 0 &{} 0 \\ 0 &{} 0 &{} 0 \end{array}\right) , \end{aligned}$$
and similarly by permuting the diagonal elements when $s_1=s_3=\alpha /4$ and when $s_2=s_4=-\alpha /4$ or $s_1=s_4=\alpha /4$ and $s_2=s_3=-\alpha /4$.

The computation of the SSVD for these matrices is an easy computation. This concludes the proof of Proposition 4.2.$\square $

4.3 Determination of the Equilibria for Each Density $\rho $

In Theorem 5 we saw that the BGK operator can have three types of equilibria. The uniform equilibria $f=\rho $ (corresponding to $J=0$) is always an equilibrium. However, the existence of the other two types of equilibria depends on Eqs. (19) and (20) having a solution for a given $\rho $. Therefore, the existence of these types of equilibria will depend on the value of $\rho $. In this section we will determine the existing equilibria for each value of $\rho $. In particular, we will draw the phase diagram for $\rho $ and $\alpha $, that is to say the parametrised curves defined by Eqs. (19) and (20) in the plane $(\rho ,\alpha )$ (see Fig. 2). We first prove the following proposition.

Proposition 4.4

Let $\rho _c:=6$.

(i)
The function $\alpha \mapsto \alpha /c_1(\alpha )$ is well defined on ${\mathbb {R}}$, and its value at zero is $\rho _c$. Moreover, there exists $\alpha ^*>0$ such that this function is decreasing on $(-\infty ,\alpha ^*]$ and increasing on $[\alpha ^*,+\infty )$. Defining $\rho ^*:=\alpha ^*/c_1(\alpha ^*)$, it holds that $\rho ^*<\rho _c$.
(ii)
The function $\alpha \mapsto \alpha /c_2(\alpha )$ is even. It is decreasing on $(-\infty ,0)$, increasing on $(0,\infty )$ and its value at zero is $\rho _c$.
(iii)
We have the following asymptotic behaviours:
$$\begin{aligned}&\frac{\alpha }{c_1(\alpha )}\underset{\alpha \rightarrow +\infty }{\sim }\alpha +1,\\&\frac{\alpha }{c_2(\alpha )}\underset{\alpha \rightarrow +\infty }{\sim }\alpha +2. \end{aligned}$$

Proof

The idea of the proof is taken from Wang and Hoffman (2008).

(i)
Since
$$\begin{aligned} \frac{\mathrm{d}}{\mathrm{d}\theta }\Big \{\sin ^2(\theta /2)\sin \theta \Big \}=\sin ^2(\theta /2)(1+2\cos \theta ), \end{aligned}$$
an integration by parts shows that:
$$\begin{aligned} \frac{\alpha }{c_1(\alpha )}=3\frac{\int _0^\pi \sin ^2(\theta /2)e^{\alpha \cos \theta }\,\mathrm{d}\theta }{\int _0^\pi \sin ^2(\theta /2)\sin ^2\theta e^{\alpha \cos \theta }\,\mathrm{d}\theta }=\frac{3}{\{ \sin ^2\theta \}_\alpha }. \end{aligned}$$
It proves that $\alpha $ and $c_1(\alpha )$ have the same sign for all $\alpha \in {\mathbb {R}}$. Then, we define function $m{:}\,\alpha \mapsto \{ \sin ^2\theta \}_\alpha =3 c_1(\alpha )/\alpha $ which satisfies the property:
$$\begin{aligned} m'(\alpha )=0\,\,\Longrightarrow m''(\alpha )<0, \end{aligned}$$
since
$$\begin{aligned} m'(\alpha )=\{ \sin ^2\theta \cos \theta \}_\alpha -\{\sin ^2\theta \}_\alpha \{ \cos \theta \}_\alpha , \end{aligned}$$
and
$$\begin{aligned} m''(\alpha )=-{{\,\mathrm{Var}\,}}_\alpha (\cos ^2\theta )-2\{\cos \theta \}_\alpha m'(\alpha ), \end{aligned}$$
where ${{\,\mathrm{Var}\,}}_\alpha $ is the variance for the probability density (14). This property implies that $\alpha /c_1(\alpha )$ has only one critical point which is a global minimum. This minimum is attained at a point $\alpha ^*>0$ as a simple computation shows that $m'(0)>0$ and consequently $\rho ^*<\rho _c$. A simple computation gives $m(0)=\frac{1}{2}$ so $\rho _c=6$.
(ii)
We have similarly:
$$\begin{aligned} \frac{\alpha }{c_2(\alpha )} = 4\frac{\int _0^\pi \sin \varphi e^{\frac{\alpha }{2}\cos \varphi }\,\mathrm{d}\varphi }{\int _0^\pi \sin ^3\varphi e^{\frac{\alpha }{2}\cos \varphi }\,\mathrm{d}\varphi }=\frac{4}{[\sin ^2\varphi ]_\alpha }, \end{aligned}$$
(25)
from which we can easily see that $\alpha \mapsto \alpha /c_2(\alpha )$ is even and has only one minimum attained at $\alpha =0$. A simple computation shows that its value at 0 is $\rho _c=6$.
(iii)
The behaviour at infinity is obtained by Laplace’s method: with the change of variable $s=1-\cos \theta $ on $[0,\pi ]$, we get
$$\begin{aligned} \frac{\alpha }{c_1(\alpha )}=3\frac{e^{\alpha }\int _0^2 e^{-\alpha s}\frac{s}{2\sqrt{1-(1-s)^2}}\,\mathrm{d}s}{e^\alpha \int _0^2 e^{-\alpha s}\frac{s}{2}\sqrt{1-(1-s)^2}\,\mathrm{d}s}\underset{\alpha \rightarrow +\infty }{\sim }\alpha +1. \end{aligned}$$
With the same method we have:
$$\begin{aligned} \frac{\alpha }{c_2(\alpha )}=4\frac{\int _0^2 e^{-\frac{\alpha }{2}s}\,\mathrm{d}s}{\int _0^2 e^{-\frac{\alpha }{2}s}(1-(1-s)^2)\,\mathrm{d}s}\underset{\alpha \rightarrow +\infty }{\sim }\alpha +2. \end{aligned}$$

$\square $

Thanks to Proposition 4.4 and Theorem 5, we can now fully describe the equilibria of the BGK operator. A graphical representation of this result is given by the phase diagram depicted in Fig. 2:

Corollary 4.2

(Equilibria of the BGK operator, depending on the density $\rho $) The set of equilibria of the BGK operator (12) depends on the value of $\rho $. In particular, we need to distinguish three regions $\rho \in (0,\rho ^*)$, $\rho \in (\rho ^*,\rho _c)$ and $\rho >\rho _c$ where $\rho ^*$ and $\rho _c$ are defined in Proposition 4.4. We have the following equilibria in each region:

For $0<\rho <\rho ^*$, $\alpha =0$ is the unique solution of Eqs. (19) and (20) and therefore the only equilibrium is the uniform equilibrium $f^{\mathrm {eq}}=\rho $.
For $\rho =\rho ^*$, in addition to the uniform equilibrium, there is a family of anisotropic equilibria given by $f^{\mathrm {eq}}=\rho ^* M_{\alpha ^* \Lambda }$ where $\Lambda \in SO_3({\mathbb {R}})$ and $\alpha ^*=\rho ^* c_1(\alpha ^*)$.
For $\rho ^*<\rho <\rho _c$, the compatibility equation (19) has two solutions $\alpha _+$ and $\alpha _-$ with $0<\alpha _-<\alpha _+$ which give, in addition to the uniform equilibrium, two families of anisotropic equilibria: $f^{\mathrm {eq}}=\rho M_{\alpha _+\Lambda }$ and $f^{\mathrm {eq}}=\rho M_{\alpha _-\Lambda }$ with $\Lambda \in SO_3({\mathbb {R}})$.
For $\rho =\rho _c$, we have $\alpha _-=0$.
For $\rho >\rho _c$, Eq. (19) has two solutions $\alpha _3<0<\alpha _1$ which give two families of anisotropic equilibria $f^{\mathrm {eq}}=\rho M_{\alpha _3\Lambda }$ and $f^{\mathrm {eq}}=\rho M_{\alpha _1\Lambda }$ with $\Lambda \in SO_3({\mathbb {R}})$. Moreover, Eq. (20) has two solutions $-\alpha _2<0<\alpha _2$ which give another family of equilibria: $f^{\mathrm {eq}}=\rho M_{\alpha _2 B}$ where $B\in {\mathscr {B}}$. The uniform equilibrium is always an equilibrium.

When an equilibrium is of the form $f^{\mathrm {eq}}=\rho M_{\alpha \Lambda }$ with parameters $\alpha >0$ and $\Lambda \in SO_3({\mathbb {R}})$, then these parameters can, respectively, be interpreted as a concentration parameter and a mean body orientation. They are analogous to the equilibria found in Degond et al. (2015) in the Vicsek case. However, in $SO_3({\mathbb {R}})$, there exist other equilibria which are not of this form. We will see in Sect. 5 that these latter equilibria are always unstable.

Finally, the following picture (Fig. 3) is a representation in the space ${\mathbb {R}}^3$ of the diagonal parts of the SSVDs of the solution of the matrix compatibility equation (18) when $\rho >\rho _c$. They all belong to the domain ${\mathscr {D}}$ defined by (11) and depicted in orange in Fig. 3.

5 Convergence to Equilibria

Now that we know all the equilibria of the spatially homogeneous BGK equation (6); we proceed to investigate the asymptotic behaviour of f(t, A) as $t\rightarrow +\infty $. This problem can be reduced to looking at the asymptotic behaviour of $J_f$ since, if $J_f\rightarrow J_\infty \in {\mathscr {M}}_3({\mathbb {R}})$, then f(t) will converge as $t\rightarrow +\infty $ towards $\rho M_{J_\infty }$ as it can be seen by writing Duhamel’s formula for Eq. (6):

$$\begin{aligned} f(t)=e^{-t}f_0+\rho \int _0^t e^{-(t-s)}M_{J_{f(s)}}\,\mathrm{d}s. \end{aligned}$$

(26)

The asymptotic behaviour of $J_f$ is much simpler than the one of f since $J_f$ is the solution of the following ODE

$$\begin{aligned} \frac{\mathrm{d}}{\mathrm{d}t}J_f=\rho \langle A\rangle _{M_{J_f}}-J_f,\,\,\,\,\,\,J_f(t=0)=J_{f_0}, \end{aligned}$$

(27)

as it can be seen by multiplying (6) by $A\in SO_3({\mathbb {R}})$ and integrating over $SO_3({\mathbb {R}})$. Since $J\in {\mathscr {M}}_3({\mathbb {R}})\mapsto M_J\in L^\infty (SO_3({\mathbb {R}}))$ is locally Lipschitz, the flow of Eq. (27) is defined globally in time since the map $J\mapsto \rho \langle A\rangle _{M_J}$ is Lipschitz with bounded Lipschitz seminorm.

Notice that the solutions of the compatibility equation (18) are exactly the equilibria of the dynamical system (27). We therefore obtain the following proposition:

Proposition 5.1

(Equilibria of the BGK operator, equilibria of the ODE (27)) A distribution $f^{\mathrm {eq}}=\rho M_J$ is an equilibrium of the BGK operator (12) if and only if $J\in {\mathscr {M}}_3({\mathbb {R}})$ is an equilibrium of the dynamical system (27).

We will call stable/unstable an equilibrium of the BGK operator (12) such that the associated matrix $J\in {\mathscr {M}}_3({\mathbb {R}})$ is a stable/unstable equilibrium of the ODE (27). This section is devoted to the proof of the following theorem:

Theorem 7

(Convergence towards equilibria). Let $\rho \in {\mathbb {R}}_+$ be such that $\rho \ne \rho ^*$ and $\rho ~\ne ~\rho _c$ (as defined in Proposition 4.4). Let $f_0$ be an initial condition for (6) and let $J_{f_0}~=~PD_0Q$ be a SSVD. Let f(t) be the solution at time $t\in {\mathbb {R}}_+$ of the spatially homogeneous BGK equation (6) with initial condition $f_0$. Let D(t) be the solution at time $t\in {\mathbb {R}}_+$ of the ODE (27) with initial condition $D_0\in {\mathscr {D}}$. It holds that:

$$\begin{aligned} J_{f(t)}=PD(t)Q \end{aligned}$$

is a SSVD and there exists a subset ${\mathscr {N}}_\rho \subset {\mathbb {R}}^3$ of zero Lebesgue measure such that, writing ${\mathcal {N}}_\rho :={{\,\mathrm{diag}\,}}({\mathscr {N}}_\rho )\subset {\mathscr {M}}_3({\mathbb {R}})$:

1.
if $D_0\notin {\mathcal {N}}_\rho $, then f(t) converges as $t\rightarrow \infty $ towards an equilibrium $f^{\mathrm {eq}}$ of the BGK operator (12) of the form $f^{\mathrm {eq}}=\rho M_{J^{\mathrm {eq}}}$, where $J^{\mathrm {eq}}\in {\mathscr {M}}_3({\mathbb {R}})$ is of one of the forms described in Theorem 5. The convergence is locally exponentially fast in the sense that there exists constants $\delta ,K,\mu >0$ such that if $\Vert J_{f_0}-J^{\mathrm {eq}}\Vert \le \delta $ then for all $t>0$,
$$\begin{aligned} \forall A\in SO_3({\mathbb {R}}),\,\,\,|f(t,A)-f^{\mathrm {eq}}(A)|\le e^{-\mu t}\Big (K\rho +|f_0(A)-f^{\mathrm {eq}}(A)|\Big ). \end{aligned}$$
2.
If $D_0\notin {\mathcal {N}}_\rho $, we have the following asymptotic behaviours depending on the density $\rho $:
1. (i)
  if $0<\rho <\rho ^*$, then $D(t)\rightarrow 0$ as $t\rightarrow +\infty $ and, consequently, $f^{\mathrm {eq}}=\rho $,
2. (ii)
  if $\rho ^*<\rho <\rho _c$, then $D(t)\rightarrow 0$ or $D(t)\rightarrow \alpha _+ I_3$ as $t\rightarrow +\infty $ and, consequently, $f^{\mathrm {eq}}=\rho $ or $f^{\mathrm {eq}}=\rho M_{\alpha _+\Lambda }$, respectively, where $\alpha _+>0$ is defined in Corollary 4.2 and $\Lambda :=PQ\in SO_3({\mathbb {R}})$,
3. (iii)
  if $\rho >\rho _c$, then $D(t)\rightarrow \alpha _1 I_3$ as $t\rightarrow +\infty $ and, consequently, $f^{\mathrm {eq}}=\rho M_{\alpha _1\Lambda }$ where $\alpha _1>0$ is defined in Corollary 4.2 and $\Lambda :=PQ\in SO_3({\mathbb {R}})$.

Remark 5.1

The subset ${\mathscr {N}}_\rho \subset {\mathbb {R}}^3$ depends on the density $\rho $. This subset will be made explicit in the three cases $\rho <\rho ^*$, $\rho ^*<\rho <\rho _c$ and $\rho >\rho _c$ in Sect. 5.3. If $D_0\in {\mathcal {N}}_\rho $, then there is convergence towards an unstable equilibrium at a rate which may not be exponential.

Remark 5.2

(Phase transitions) Theorem 7 demonstrates a phase transition phenomenon triggered by the density of agents $\rho $: when $\rho <\rho ^*$, the system is disordered (asymptotically in time) in the sense that $J_f\rightarrow 0$ and we therefore cannot define a mean body-attitude. When the density increases and exceeds the critical value $\rho _c$, the system is self-organised (asymptotically in time and for almost every initial data), in the sense that $J_f\rightarrow \alpha \Lambda $ where $\alpha \in {\mathbb {R}}_+$ and $\Lambda \in SO_3({\mathbb {R}})$ can be, respectively, interpreted as a concentration parameter and a mean body attitude. When $\rho ^*<\rho <\rho _c$, the self-organised and disordered states are both asymptotically stable and the convergence towards one or the other state depends on the initial data. Such “transition region” also appears in the Vicsek model, as studied in Degond et al. (2015), and gives rise to an hysteresis phenomenon.

The proof of this theorem can be found at the end of Sect. 5.2. It is based on a gradient flow structure for the flux $J_f$ studied in Sect. 5.1. This structure ensures the convergence of $J_f$ towards a matrix $J^{\mathrm {eq}}\in {\mathscr {M}}_3({\mathbb {R}})$ as $t\rightarrow +\infty $ and consequently the convergence of f(t) as $t\rightarrow +\infty $ towards an equilibrium. The stability of the equilibria determines which equilibrium can be attained. This question is addressed in Sect. 5.2. Additional details about the subset ${\mathscr {N}}_\rho $ as well as the study of the critical cases $\rho =\rho ^*$ and $\rho =\rho _c$ are provided in Sect. 5.3.

5.1 A Gradient-Flow Structure in $\pmb {{\mathbb {(R)}}}^3$

In this section we show that the ODE (27) can be reduced to a gradient-flow ODE in ${\mathbb {R}}^3$. We first show (Sect. 5.1.1) how (27) can be reduced to an ODE in ${\mathbb {R}}^3$, the equilibria of which are linked to the equilibria of (27) [and therefore of (6)]. Then, we show that this ODE in ${\mathbb {R}}^3$ has a gradient-flow structure which will allow us to conclude on the asymptotic behaviour of the solution of (6) (Sect. 5.1.2).

5.1.1 Reduction to a Nonlinear ODE in ${\mathbb {R}}^3$ and Equilibria

The ODE (27) is a matrix-valued nonlinear ODE (in dimension 9) but, as in the previous section (Proposition 4.1 and Corollary 4.1), we will use the left and right invariance of the Haar measure and the SSVD to reduce the problem to a vector-valued nonlinear ODE in dimension 3, as explained in Proposition 5.2 and Corollary 5.1.

Proposition 5.2

(Reduction to a nonlinear ODE in dimension 3) Let $J_0\in {\mathscr {M}}_3({\mathbb {R}})$ be a given matrix and let $D_0\in {{\,\mathrm{Orb}\,}}(J_0)$ be diagonal. Let $P,Q\in SO_3({\mathbb {R}})$ such that $J_0=PD_0Q$. Let $J:[0,\infty )\rightarrow {\mathscr {M}}_3({\mathbb {R}})$ be a $C^1$ curve in ${\mathscr {M}}_3({\mathbb {R}})$ with $J(0)=J_0$. For all $t>0$, let $D(t)\in {\mathscr {M}}_3({\mathbb {R}})$ be the matrix such that $J(t)=PD(t)Q$. It holds that:

(i)
$J=J(t)$ is the solution of the ODE (27) with initial condition $J(t=0)=J_0$ if and only if $D=D(t)$ is the solution of the same ODE (27) with initial condition $D(t=0)=D_0$.
(ii)
Moreover, if (i) holds, then the matrix $D(t)\in {\mathscr {M}}_3({\mathbb {R}})$ is diagonal for all time.

Proof

(i)
Using the left and right invariance of the Haar measure, we see that if the matrix $J(t)\in {\mathscr {M}}_3({\mathbb {R}})$ is a solution of (27), then for any $P,Q\in SO_3({\mathbb {R}})$, PJ(t)Q is also a solution (and conversely).
(ii)
Since $D_0$ is diagonal, the fact that D(t) is also diagonal is a consequence of Lemma 3.2 which states that $\langle A\rangle _{M_D}$ is diagonal when D is diagonal.

$\square $

For any matrix $J_0$, such a diagonal matrix $D_0$ always exists: we can take the diagonal part of the SSVD of $J_0$. We therefore only have to study the following ODE for diagonal matrices ${\mathscr {D}}_3({\mathbb {R}})$. Since this vector space is isomorphic to ${\mathbb {R}}^3$ through the isomorphism ${{\,\mathrm{diag}\,}}$ defined in the introduction, we obtain two equivalent ODEs:

$$\begin{aligned} \frac{\mathrm{d}}{\mathrm{d}t}D(t)&=\rho \langle A\rangle _{M_{D(t)}}-D(t),\,\,\,\,\,\,D(t=0)=D_0, \end{aligned}$$

(28a)

$$\begin{aligned} \frac{\mathrm{d}}{\mathrm{d}t}{\widehat{D}}(t)&=\rho {{\,\mathrm{diag}\,}}^{-1}\left( \langle A\rangle _{M_{{{\,\mathrm{diag}\,}}({\widehat{D}}(t))}}\right) -{\widehat{D}}(t),\,\,\,\,\,\,{\widehat{D}}(t=0)={\widehat{D}}_0, \end{aligned}$$

(28b)

where ${\widehat{D}}_0={{\,\mathrm{diag}\,}}^{-1}(D_0)$ and the following equivalence holds: ${\widehat{D}}(t)={{\,\mathrm{diag}\,}}^{-1}(D(t))$ is the solution of (28b) if and only if $D(t)={{\,\mathrm{diag}\,}}({\widehat{D}}(t))$ is the solution of (28a).

Note that it is not clear that if $J_0=PD_0 Q$ is a SSVD, then $J(t)=PD(t)Q$ is a SSVD for all $t>0$. The following proposition and corollary ensure that the SSVD is preserved by the dynamical system which will allow us to restrict the domain on which the ODE (28a) is posed.

Proposition 5.3

(Invariant manifolds). The following subsets of ${\mathbb {R}}^3$ are invariant manifolds of the dynamical system (28b):

the planes $\big \{(d_1,d_2,d_3)\in {\mathbb {R}}^3,\,\,d_i+d_j=0\big \}$ for $i\ne j\in \{1,2,3\}$,
the planes $\big \{(d_1,d_2,d_3)\in {\mathbb {R}}^3,\,\,d_i-d_j=0\big \}$ for $i\ne j\in \{1,2,3\}$,
the intersections of two of these planes and in particular the lines
$$\begin{aligned} {\mathbb {R}}\left( \begin{array}{c} 1\\ 1\\ 1 \end{array}\right) ,\,\,\,\,\, {\mathbb {R}}\left( \begin{array}{c}1\\ 0\\ 0 \end{array}\right) ,\,\,\,\,\,{\mathbb {R}}\left( \begin{array}{c} 1\\ 1\\ -1 \end{array}\right) \ldots \end{aligned}$$

Proof

For $i=2$ and $j=3$, the result has already been proved in the second point of Lemma 4.1. The other cases are similar.$\square $

Corollary 5.1

Let $J_0\in {\mathscr {M}}_3({\mathbb {R}})$ and $J_0=PD_0Q$ be a SSVD with $P,Q\in SO_3({\mathbb {R}})$ and $D_0\in {\mathscr {D}}_3({\mathbb {R}})$ diagonal. Let J(t) be the solution of the ODE (27) with initial condition $J(t=0)=J_0$. Let D(t) the solution of the ODE (28a) with initial condition $D(t=0)=D_0$. Then, the decomposition $J(t)=PD(t)Q$ is a SSVD for J(t).

Proof

The fact that $J(t)=PD(t)Q$ is a consequence of the first point of Proposition 5.2. The fact that it is a SSVD is a consequence of Proposition 5.3 which ensures that the conditions (11) remain true for all $t>0$: ${\mathscr {D}}$ is stable in the sense that if $D_0\in {\mathscr {D}}$, then $D(t)\in {\mathscr {D}}$ for all time $t>0$. This follows from the fact that the image by the isomorphism ${{\,\mathrm{diag}\,}}$ of the invariant manifolds of (28b) described in Proposition 5.3 are invariant manifolds of the dynamical system (28a). These manifolds in ${\mathscr {D}}_3({\mathbb {R}})$ form the boundary of the subset ${\mathscr {D}}$.$\square $

In conclusion, the study of the asymptotic behaviour of f(t) as $t\rightarrow +\infty $ can be reduced to the study of the asymptotic behaviour of the solutions of the ODE (28a) posed on the domain ${\mathscr {D}}$ (see Fig. 3).

The following Proposition describes the equilibria of the dynamical system (28a) and is a consequence of the results of Sect. 4.

Proposition 5.4

(Equilibria of the dynamical system (28a)). The equilibria of the dynamical system (28a) are, depending on the density $\rho $:

the matrix $D=0$ for any density $\rho $,
the diagonal matrices of type (b) with parameters $\alpha _+$ and $\alpha _-$ when $\rho ^*<\rho <\rho _c$,
the diagonal matrices of type (b) with parameters $\alpha _1$ and $\alpha _3$ and the diagonal matrices of type (c) with parameters $\pm \alpha _2$ when $\rho >\rho _c$,

where the types (b) and (c) are defined in Proposition 4.2 and the elements $\alpha _+$, $\alpha _-$, $\alpha _1$, $\alpha _2$ and $\alpha _3$ are defined in Corollary 4.2.

Proof

The equilibria of (28a) are the diagonal matrices D such that

$$\begin{aligned} \rho \langle A\rangle _{M_D}-D=0, \end{aligned}$$

which is exactly Eq. (18) for the diagonal matrices. This equation has been solved in Theorem 5, Remark 4.8 and Corollary 4.2. $\square $

Remark 5.3

As in the previous section, we have found all the diagonal equilibria of (28a). However, thanks to Corollary 5.1, only the ones which belong to ${\mathscr {D}}$ are needed.

The following proposition is a straightforward consequence of the previous results.

Proposition 5.5

(Equilibria of the BGK operator, equilibria of the ODE (28a)). Let $J\in ~{\mathscr {M}}_3({\mathbb {R}})$ with a SSVD given by $J=PDQ$. The following assertions are equivalent.

(1)
The distribution $f^{\mathrm {eq}}=\rho M_J$ is an equilibrium of the BGK operator (12).
(2)
The matrix D is an equilibrium of the dynamical system (28a) on the domain ${\mathscr {D}}$.
(3)
The matrix D is of one of the forms:
- $D=0$, when $\rho <\rho ^*$,
- $D=0$ or $D=\alpha _- I_3$ or $D=\alpha _+ I_3$, when $\rho ^*\le \rho \le \rho _c$,
- $D=0$ or $D=\alpha _1 I_3$ or $D=\alpha _3 {{\,\mathrm{diag}\,}}(-1,-1,1)$ or $D=\alpha _2{{\,\mathrm{diag}\,}}(1,0,0)$, when $\rho >\rho _c$,
where $\rho ^*$, $\rho _c$, $\alpha _-$, $\alpha _+$, $\alpha _1$, $\alpha _2$ and $\alpha _3$ are defined in Proposition 4.4 and Corollary 4.2.

5.1.2 A Gradient-Flow Structure

Lemma 5.1

(Gradient-flow structure) We define the partition function of a matrix $J~\in ~{\mathscr {M}}_3({\mathbb {R}})$:

$$\begin{aligned} {\mathcal {Z}}(J){:}\,=\int _{SO_3({\mathbb {R}})} e^{J\cdot A}\,\mathrm{d}A, \end{aligned}$$

(29)

and the potentials V(J) and ${\widehat{V}}({\widehat{D}})$, respectively, on ${\mathscr {M}}_3({\mathbb {R}})$ and on ${\mathbb {R}}^3$:

$$\begin{aligned} V(J)&:=\frac{1}{2}\Vert J\Vert ^2-\rho \log {\mathcal {Z}}(J), \end{aligned}$$

(30a)

$$\begin{aligned} {\widehat{V}}({\widehat{D}})&:=\frac{1}{2}|{\widehat{D}}|^2-2\rho \log {\mathcal {Z}}({{\,\mathrm{diag}\,}}({\widehat{D}})) \end{aligned}$$

(30b)

where $\Vert \cdot \Vert $ and $|\cdot |$ are, respectively, the norm on ${\mathscr {M}}_3({\mathbb {R}})$ induced by the inner product (7) and the Euclidean norm on ${\mathbb {R}}^3$. Then, we can rewrite equations (28a) and (28b) into a gradient flow structure as follows:

$$\begin{aligned} \frac{\mathrm{d}}{\mathrm{d}t}D(t)&=\rho \langle A\rangle _{M_{D(t)}}-D(t)=-\nabla V(D), \end{aligned}$$

(31a)

$$\begin{aligned} \frac{\mathrm{d}}{\mathrm{d}t}{\widehat{D}}(t)&=\rho {{\,\mathrm{diag}\,}}^{-1}\left( \langle A\rangle _{M_{{{\,\mathrm{diag}\,}}({\widehat{D}}(t))}}\right) -{\widehat{D}}(t)=-\nabla {\widehat{V}}({\widehat{D}}), \end{aligned}$$

(31b)

where $\nabla $ is the gradient operator in ${\mathscr {M}}_3({\mathbb {R}})$ endowed with the Riemannian structure (7) or the gradient operator in ${\mathbb {R}}^3$ endowed with the usual Euclidean structure.

Proof

The partition function satisfies that for all $J\in {\mathscr {M}}_3({\mathbb {R}})$,

$$\begin{aligned} \nabla (\log {\mathcal {Z}})(J)=\langle A\rangle _{M_J}, \end{aligned}$$

since $\nabla (e^{J\cdot A})=Ae^{J\cdot A}$. The result follows in ${\mathscr {M}}_3({\mathbb {R}})$. The result in ${\mathbb {R}}^3$ follows from the fact that for any $w_1,w_2\in {\mathbb {R}}^3$, it holds that:

$$\begin{aligned} w_1\cdot w_2=2{{\,\mathrm{diag}\,}}(w_1)\cdot {{\,\mathrm{diag}\,}}(w_2) \end{aligned}$$

where $\cdot $ denotes the dot product on ${\mathbb {R}}^3$ and on ${\mathscr {M}}_3({\mathbb {R}})$ as defined in (7). $\square $

Remark 5.4

This gradient-flow structure on $J_f$ is specific to the BGK equation and does not hold for the Fokker–Planck operator (as shown in Degond et al. 2015, the differential equation satisfied by $J_f$ in the Vicsek case involves the spherical harmonics of degree 2 and higher of f; here the equation for $J_f$ is closed).

As an application of the gradient-flow structure, we can prove that all the trajectories of (27) and (31a) are bounded: since the potential V in (30a) is decreasing along the trajectories, we can write for all $t>0$

$$\begin{aligned} \frac{1}{2}\Vert J(t)\Vert ^2 \le V(J_0)+\rho \log {\mathcal {Z}}(J(t)). \end{aligned}$$

Using the fact that $SO_3({\mathbb {R}})$ is compact with volume one, by the mean-value theorem applied to (29) there exists ${\bar{A}}\in SO_3({\mathbb {R}})$ such that ${\mathcal {Z}}(J(t))\le e^{{\bar{A}}\cdot J(t)}$. Therefore, owing to the fact that $\Vert A\Vert \le \sqrt{3/2}$ for all $A\in SO_3({\mathbb {R}})$, we obtain using the Cauchy–Schwarz inequality that $\log {\mathcal {Z}}(J(t))\le \sqrt{\frac{3}{2}}\Vert J(t)\Vert $. Then, by Young’s inequality we get:

$$\begin{aligned} \frac{1}{4}\Vert J(t)\Vert ^2\le V(J_0)+\frac{3}{2}\rho ^2. \end{aligned}$$

Moreover, since the potential V is analytic, a classical result by Haraux (2012), Łojasiewicz (1984) implies that the solution to (31a) and (27) will converge towards an equilibrium. The same holds for the system (31b).

When all the equilibria of the dynamical system (28a) are hyperbolic the convergence towards the stable equilibria is exponentially fast [see (Hirsch et al. 2012, Section 9.3) and the Hartman–Grobman theorem (Perko 2013, Section 2.8)]. The goal of the next section is to find which equilibria among the ones found in Sect. 4 are stable and to completely describe the asymptotic behaviour of the system depending on the initial condition and the local density $\rho $. We will see that phase transitions appear between ordered and disordered dynamics when the density $\rho $ increases.

5.2 Stability of the Equilibria and Conclusion

Since the flow of (28b) is the image by the isomorphism ${{\,\mathrm{diag}\,}}^{-1}$ of the flow of (28a), an equilibria $D^{\mathrm {eq}}$ of (28a) is stable (resp. unstable) if and only if ${{\,\mathrm{diag}\,}}^{-1}(D^{\mathrm {eq}})$ is a stable (resp. unstable) equilibria of (28b). The stability properties of the equilibria of (28b) are much simpler to study than the stability properties of the equilibria of (28a) since they are given by the signature of the Hessian matrix ${{\,\mathrm{Hess}\,}}{\widehat{V}}({\widehat{D}}^{\mathrm {eq}})\in {\mathscr {S}}_3({\mathbb {R}})$ of the potential ${\widehat{V}}$ given in Eq. (30b) (the linearisation of ODE (28b) around an equilibrium ${\widehat{D}}^{\mathrm {eq}}$ is indeed $\frac{\mathrm{d}}{\mathrm{d}t}{\widehat{H}}=-{{\,\mathrm{Hess}\,}}{\widehat{V}}({\widehat{D}}^{\mathrm {eq}}){\widehat{H}}$). In particular, an equilibrium ${\widehat{D}}^{\mathrm {eq}}$ is stable if and only if the signature of ${{\,\mathrm{Hess}\,}}{\widehat{V}}({\widehat{D}}^{\mathrm {eq}})$ is $(+++)$.

Note that in the matrix framework (31a), the Hessian of the potential (30a) in the Euclidean space ${\mathscr {M}}_3({\mathbb {R}})$ would be a rank 4 tensor. Here we are reduced to the computation of the signature of $3\times 3$ symmetric matrices. For a diagonal matrix $D\in {\mathscr {D}}_3({\mathbb {R}})$ and ${\widehat{D}}={{\,\mathrm{diag}\,}}^{-1}(D)$ we will write with a slight abuse of notations:

$$\begin{aligned} {{\,\mathrm{Hess}\,}}V(D)\equiv {{\,\mathrm{Hess}\,}}{\widehat{V}}({\widehat{D}}). \end{aligned}$$

When $D\in {\mathscr {D}}_3({\mathbb {R}})$ is an equilibrium of (28a), we call signature of the Hessian matrix ${{\,\mathrm{Hess}\,}}V(D)$ the signature of ${{\,\mathrm{Hess}\,}}{\widehat{V}}({\widehat{D}})$ where ${\widehat{D}}={{\,\mathrm{diag}\,}}^{-1}(D)$. A simple computation shows that the Hessian matrix ${{\,\mathrm{Hess}\,}}V(D)$ is given by:

$$\begin{aligned} {{\,\mathrm{Hess}\,}}V(D)=I_3-\frac{1}{2}\rho \Gamma ^D, \end{aligned}$$

(32)

where $\Gamma ^D=(\Gamma ^D_{ij})_{i,j}$ with:

$$\begin{aligned} \Gamma ^D_{ij}=\langle a_{ii} a_{jj}\rangle _{M_D}-\langle a_{ii}\rangle _{M_D}\langle a_{jj}\rangle _{M_D}. \end{aligned}$$

The following theorem is an extension of Corollary 4.2 and gives a full description of the equilibria of the BGK operator with their stability.

Theorem 8

(Stability of the equilibria of the ODE (28a))

For $0<\rho <\rho ^*$, the only equilibrium is $D=0$. This equilibrium is stable.
For $\rho ^*<\rho <\rho _c$, the equilibrium $D=0$ and the equilibria of type (b) with parameter $\alpha _+$ are stable. The equilibria of type (b) with parameter $\alpha _-$ are unstable and the signature of the Hessian matrix is $(-++)$.
For $\rho >\rho _c$, the stable equilibria are the equilibria of type (b) with parameter $\alpha _1$. The other equilibria ($D=0$, type (b) with parameter $\alpha _3$ and type (c) with parameter $\pm \alpha _2$) are unstable and the signatures of the Hessian matrix are, respectively, $(---)$, $(+--)$ and $(++-)$.

The proof of Theorem 8 will be based on the following lemma which states an important orbital invariance principle for the signature of the Hessian matrix.

Lemma 5.2

(Orbital invariance of the signature) Let $G_D$ and $G_P$ the subgroups of $SO_3({\mathbb {R}})$, respectively, generated by the matrices $D^{ij}$ and $P^{ij}$ defined in Definition 3.1. These subgroups act on ${\mathscr {D}}_3({\mathbb {R}})$, respectively, by left multiplication and by conjugation. Let $D\in {\mathscr {D}}_3({\mathbb {R}})$ be a diagonal matrix. Then, the signature of ${{\,\mathrm{Hess}\,}}V(D)$ is invariant by the action of $G_D$ and $G_P$ on D.

Proof

Let us first introduce the map $\pi ^{{{\,\mathrm{diag}\,}}}{:}\,{\mathscr {M}}_3({\mathbb {R}})\rightarrow {\mathbb {R}}^3, M\mapsto (M_{ii})_i$. Let $L\in G_D$ and $P\in G_P$. The map $\pi ^{{{\,\mathrm{diag}\,}}}$ satisfies for all $M\in {\mathscr {M}}_3({\mathbb {R}})$:

$$\begin{aligned} \pi ^{{{\,\mathrm{diag}\,}}}(LM)=L\pi ^{{{\,\mathrm{diag}\,}}}(M),\,\,\,\text {and}\,\,\,\pi ^{{{\,\mathrm{diag}\,}}}(PMP^\mathrm{T}) = R\pi ^{{{\,\mathrm{diag}\,}}}(M) \end{aligned}$$

(33)

where $R:=P\odot P\in {\mathcal {O}}_3({\mathbb {R}})$ is the Hadamard product (or element-wise product) of P with itself (that is R is the permutation matrix obtained by removing all the minuses in P). Let ${\widehat{H}}\in {\mathbb {R}}^3$ and $H:={{\,\mathrm{diag}\,}}({\widehat{H}})\in {\mathscr {D}}_3({\mathbb {R}})$. From the expression (32) of the Hessian matrix ${{\,\mathrm{Hess}\,}}V(D)\equiv {{\,\mathrm{Hess}\,}}{\widehat{V}}({\widehat{D}})$, it follows that:

$$\begin{aligned} {{\,\mathrm{Hess}\,}}V(D){\widehat{H}} = {\widehat{H}} - \rho \left( \left\langle (A\cdot H) \pi ^{{{\,\mathrm{diag}\,}}}(A)\right\rangle _{M_D}-\langle A\cdot H\rangle _{M_D}\langle \pi ^{{{\,\mathrm{diag}\,}}}(A)\rangle _{M_D}\right) . \end{aligned}$$

Using the left and right invariance of the Haar measure and (33), we obtain:

$$\begin{aligned} {{\,\mathrm{Hess}\,}}V(LD){\widehat{H}}&= {\widehat{H}} - \rho L\left( \left\langle (A\cdot L^\mathrm{T}H) \pi ^{{{\,\mathrm{diag}\,}}}(A)\right\rangle _{M_D}-\langle A\cdot L^\mathrm{T}H\rangle _{M_D}\langle \pi ^{{{\,\mathrm{diag}\,}}}(A)\rangle _{M_D}\right) \\&= L{{\,\mathrm{Hess}\,}}V(D){{\,\mathrm{diag}\,}}^{-1}(L^\mathrm{T}H) \end{aligned}$$

and therefore, since ${{\,\mathrm{diag}\,}}^{-1}(L^\mathrm{T} H) = L^\mathrm{T}{\widehat{H}}$, it follows that

$$\begin{aligned} {{\,\mathrm{Hess}\,}}V(LD) = L{{\,\mathrm{Hess}\,}}V(D)L^\mathrm{T}. \end{aligned}$$

Similarly, using (33), we obtain

$$\begin{aligned} {{\,\mathrm{Hess}\,}}V(PDP^\mathrm{T}){\widehat{H}} = {\widehat{H}} - \rho R\left( \left\langle (A\cdot P^\mathrm{T}HP) \pi ^{{{\,\mathrm{diag}\,}}}(A)\right\rangle _{M_D}-\langle A\cdot P^\mathrm{T}HP\rangle _{M_D}\langle \pi ^{{{\,\mathrm{diag}\,}}}(A)\rangle _{M_D}\right) \end{aligned}$$

and since ${{\,\mathrm{diag}\,}}^{-1}(P^\mathrm{T}HP) = R^\mathrm{T}{\widehat{H}}$, it follows that

$$\begin{aligned} {{\,\mathrm{Hess}\,}}V(PDP^\mathrm{T}) = R{{\,\mathrm{Hess}\,}}V(D)R^\mathrm{T}. \end{aligned}$$

The conclusion follows from Sylvester’s law of inertia (Lax 2007, Chapter 8 Theorem 1). $\square $

If ${\widehat{D}}$ is an equilibrium of (28b), then ${{\,\mathrm{diag}\,}}({\widehat{D}})$ is of one type (a), (b) or (c) defined in Proposition 4.2. For the types (b) and (c), the diagonal matrix ${{\,\mathrm{diag}\,}}({\widehat{D}})$ is therefore in the orbit of, respectively, $\alpha I_3$ (by the action of $G_D$) or $\alpha {{\,\mathrm{diag}\,}}(1,0,0)$ (by the action of $G_P$) where $\alpha $ solves the associated compatibility equation. As a consequence of Lemma 5.2, one only needs to look at the signature of the Hessian matrix of ${\widehat{V}}$ taken in one of these two representatives. We are now ready to prove Theorem 8.

Proof of Theorem 8

Thanks to Lemma 5.2, we therefore do not have to compute the signatures of the Hessian matrix taken at all equilibria, and it is enough to choose one matrix in each orbit: for the equilibria of type (b) (see Proposition 4.2) we will compute the signature of ${{\,\mathrm{Hess}\,}}V(\alpha I_3)$ where $\alpha =\rho c_1(\alpha )$ and for the equilibria of type (c) we will compute the signature of ${{\,\mathrm{Hess}\,}}V(\alpha {{\,\mathrm{diag}\,}}(1,0,0))$ where $\alpha =\rho c_2(\alpha )$. We first start with the case of the uniform equilibrium.

Case 1. Uniform equilibrium $D=0$

For the uniform equilibrium $D=0$, $\langle a_{ij}\rangle =0$ for all (i, j). Moreover, by the change of variable $A'= D^{ik}A$ where $k\ne i,j$, we have when $i\ne j$:

$$\begin{aligned} \langle a_{ii}a_{jj}\rangle =-\langle a_{ii}a_{jj}\rangle =0. \end{aligned}$$

It proves that $\Gamma $ is diagonal. Then, with the changes of variables $A'=P^{ij}A$ or $A'=AP^{ij}$ it can be seen that all the $3^2$ quantities $\langle a_{ij}^2\rangle $ are equal. Since their sum is equal to $n=3$, we get that

$$\begin{aligned} {{\,\mathrm{Hess}\,}}V(0)=\left( 1-\frac{1}{2}\rho \langle a_{11}^2\rangle \right) I_3= \left( 1-\frac{\rho }{\rho _c}\right) I_3, \end{aligned}$$

where $\rho _c=6$. In conclusion, the signature of ${{\,\mathrm{Hess}\,}}V(0)$ is $(+++)$ if $\rho <\rho _c$ and $(---)$ if $\rho >\rho _c$.

Case 2. Equilibria of type (b) $D=\alpha I_3$

Let $D=\alpha I_3$ with $\alpha =\rho c_1(\alpha )$. We have $c_1(\alpha )=\langle a_{11}\rangle _{M_D}=\langle a_{22}\rangle _{M_D}=\langle a_{33}\rangle _{M_D}$ and a change of variable of the type $A'=P^{ij}A(P^{ij})^\mathrm{T}$ shows that all the $\langle a_{kk} a_{\ell \ell }\rangle _{M_D}$ are equal. The Hessian matrix is therefore equal to:

$$\begin{aligned} {{\,\mathrm{Hess}\,}}V(\alpha I_3)=I_3-\frac{1}{2}\rho \left( \begin{array}{ccc} \nu &{} \gamma &{} \gamma \\ \gamma &{} \nu &{} \gamma \\ \gamma &{} \gamma &{}\nu \end{array}\right) , \end{aligned}$$

with $\nu =\langle a_{11}^2\rangle _{M_D}-\langle a_{11}\rangle _{M_D}^2$ and $\gamma =\langle a_{11}a_{22}\rangle _{M_D}-\langle a_{11}\rangle _{M_D}\langle a_{22}\rangle _{M_D}$. The eigenvalues of ${{\,\mathrm{Hess}\,}}V(D)$ are:

$1-\frac{1}{2}\rho \nu -\rho \gamma $ of order 1 with eigenvector $(1,1,1)^\mathrm{T}$. But taking the derivative with respect to $\alpha $ of (13) with $\Lambda =I_3$, we obtain the relation:
$$\begin{aligned} c_1'(\alpha )=\frac{1}{2}\langle a_{11}^2\rangle _{M_D}+\langle a_{11}a_{22}\rangle _{M_D}-\frac{3}{2}\langle a_{11}\rangle _{M_D}^2=\frac{1}{2}\nu +\gamma , \end{aligned}$$
and using $\rho =\alpha /c_1(\alpha )$ we can rewrite:
$$\begin{aligned} 1-\frac{1}{2}\rho \nu -\rho \gamma =1-\frac{\alpha c_1'(\alpha )}{c_1(\alpha )}=c_1(\alpha )\left( \frac{id}{c_1}\right) '(\alpha ). \end{aligned}$$
Its sign is then given by Proposition 4.4 and the fact that $c_1(\alpha )$ has the same sign as $\alpha $: $c_1(\alpha )\left( \frac{id}{c_1}\right) '(\alpha )>0$ when $\alpha <0$, $c_1(\alpha )\left( \frac{id}{c_1}\right) '(\alpha )<0$ when $0\le \alpha <\alpha ^*$ and $c_1(\alpha )\left( \frac{id}{c_1}\right) '(\alpha )>0$ when $\alpha >\alpha ^*$.
$1-\frac{1}{2}\rho \nu +\frac{1}{2}\rho \gamma $ of order 2 with eigenvectors $(1,-1,0)^\mathrm{T}$ and $(0,1,-1)^\mathrm{T}$. It can be rewritten as:
$$\begin{aligned} 1-\frac{1}{2}\rho \nu +\frac{1}{2}\rho \gamma =1-\frac{\alpha }{4c_1(\alpha )}\big \langle (a_{11}-a_{22})^2\big \rangle _{M_D}. \end{aligned}$$
To determine this sign, we use the explicit volume form of $SO_3({\mathbb {R}})$ given by Rodrigues’ formula (8) to see that :
$$\begin{aligned} \frac{\alpha }{4c_1(\alpha )}\Big \langle (a_{11}-a_{22})^2\Big \rangle _{M_D} = \frac{\alpha }{5}\cdot \frac{\int _0^\pi \sin ^2(\theta /2)(1-\cos \theta )^2e^{\alpha \cos \theta }\,\mathrm{d}\theta }{\int _0^\pi \sin ^2(\theta /2)(1+2\cos \theta )e^{\alpha \cos \theta }\,\mathrm{d}\theta }. \end{aligned}$$

Lemma 5.3

The function

$$\begin{aligned} f:x\longmapsto 1- \frac{x}{5}\cdot \frac{\int _0^\pi \sin ^2(\theta /2)(1-\cos \theta )^2e^{x\cos \theta }\,\mathrm{d}\theta }{\int _0^\pi \sin ^2(\theta /2)(1+2\cos \theta )e^{x\cos \theta }\,\mathrm{d}\theta }, \end{aligned}$$

satisfies $f(0)=0$, $f(x)\ge 0$ if $x\ge 0$ and $f(x)\le 0$ if $x\le 0$

Proof

The value of f(0) is given by the expansion of $\exp $. Note that:

$$\begin{aligned} \frac{\mathrm{d}}{\mathrm{d}\theta }\Big \{\sin ^2(\theta /2)\sin \theta \Big \}=\sin ^2(\theta /2)(1+2\cos \theta ), \end{aligned}$$

so that an integration by parts shows:

$$\begin{aligned} \int _0^\pi \sin ^2(\theta /2)(1+2\cos \theta )e^{x\cos \theta }\,\mathrm{d}\theta =x\int _0^\pi \sin ^2(\theta /2)\sin ^2(\theta )e^{x\cos \theta }\mathrm{d}\theta . \end{aligned}$$

We get that:

$$\begin{aligned} f(x)=1-\frac{1}{5}\frac{\int _0^\pi \sin ^2(\theta /2)(1-\cos \theta )^2e^{x\cos \theta }\,\mathrm{d}\theta }{\int _0^\pi \sin ^2(\theta /2)\sin ^2(\theta )e^{x\cos \theta }\mathrm{d}\theta }. \end{aligned}$$

We have:

$$\begin{aligned} f(x)\ge 0\Longleftrightarrow \int _0^\pi \sin ^2(\theta /2) \left( \frac{1}{5}(1-\cos \theta )^2-\sin ^2(\theta )\right) e^{x\cos \theta }\,\mathrm{d}\theta \le 0. \end{aligned}$$

Linearising $\sin ^2(\theta /2)$ and expanding everything gives:

$$\begin{aligned} \sin ^2(\theta /2) \left( \frac{1}{5}(1-\cos \theta )^2-\sin ^2(\theta )\right) =\frac{-1}{5}(3\cos \theta +2)(1-\cos \theta )^2, \end{aligned}$$

so that:

$$\begin{aligned} f(x)\ge 0\Longleftrightarrow \int _0^\pi (3\cos \theta +2)(1-\cos \theta )^2 e^{x\cos \theta }\,\mathrm{d}\theta \ge 0. \end{aligned}$$

Now for $x\ge 0$, let $\theta _0=\arccos (-2/3)$. We cut the integral at $\theta _0$ and we get

$$\begin{aligned} \int _0^{\theta _0} (3\cos \theta +2)(1-\cos \theta )^2 e^{x\cos \theta }\,\mathrm{d}\theta \ge e^{-\frac{2}{3}x}\int _0^{\theta _0} (3\cos \theta +2)(1-\cos \theta )^2\,\mathrm{d}\theta , \end{aligned}$$

since the integrand is non-negative and $x\ge 0$. Similarly, when the integrand is non-positive

$$\begin{aligned} \int _{\theta _0}^\pi (3\cos \theta +2)(1-\cos \theta )^2 e^{x\cos \theta }\,\mathrm{d}\theta \ge e^{-\frac{2}{3}x}\int _{\theta _0}^\pi (3\cos \theta +2)(1-\cos \theta )^2 \,\mathrm{d}\theta . \end{aligned}$$

And finally:

$$\begin{aligned} \int _0^\pi (3\cos \theta +2)(1-\cos \theta )^2 e^{x\cos \theta }\,\mathrm{d}\theta \ge e^{-\frac{2}{3}x}\int _0^\pi (3\cos \theta +2)(1-\cos \theta )^2\,\mathrm{d}\theta =0. \end{aligned}$$

We find similarly that $f(x)\le 0$ when $x\le 0$. $\square $

And therefore, we can deduce the sign of the eigenvalue: $1-\frac{\alpha }{4c_1(\alpha )}\big \langle (a_{11}-a_{22})^2\big \rangle _{M_D}>0$ when $\alpha >0$ and $1-\frac{\alpha }{4c_1(\alpha )}\Big \langle (a_{11}-a_{22})^2\big \rangle _{M_D}<0$ when $\alpha <0$.

Case 3. Equilibria of type (c) $D=\alpha {{\,\mathrm{diag}\,}}(1,0,0)$

For $D=\alpha {{\,\mathrm{diag}\,}}(1,0,0)$ with $\alpha =\rho c_2(\alpha )$, using the parametrisation (10), it holds that:

$$\begin{aligned}&\int _{SO_3({\mathbb {R}})}a_{22}e^{\frac{\alpha }{2}a_{11}}\,\mathrm{d}A=\int _{SO_3({\mathbb {R}})}a_{22}e^{\frac{\alpha }{2}a_{33}}\,\mathrm{d}A\\&\quad =\frac{1}{8\pi ^2}\int _{\theta =0}^{2\pi }\int _{\phi _2=0}^\pi e^{\frac{\alpha }{2}\cos \phi _2}\sin \phi _2\int _{\phi _1=0}^{2\pi }(-\sin \theta \sin \phi _1+\cos \theta \cos \phi _1\cos \phi _2)\mathrm{d}\phi _1\mathrm{d}\phi _2\mathrm{d}\theta \\&\quad =0. \end{aligned}$$

where the first equality comes from the change of variable $A\mapsto P^{13}A(P^{13})^\mathrm{T}$. Similarly,

$$\begin{aligned} \langle a_{22}\rangle _{M_D}=\langle a_{33}\rangle _{M_D}=\langle a_{11} a_{22}\rangle _{M_D}=\langle a_{11} a_{33}\rangle _{M_D}=0. \end{aligned}$$

The Hessian matrix is therefore equal to:

$$\begin{aligned} I_3-\frac{1}{2}\rho \left( \begin{array}{ccc} \langle a_{11}^2\rangle _{M_D}-\langle a_{11}\rangle _{M_D}^2 &{}0&{}0 \\ 0 &{} \langle a_{22}^2\rangle _{M_D} &{} \langle a_{22}a_{33}\rangle _{M_D} \\ 0 &{} \langle a_{22}a_{33}\rangle _{M_D} &{} \langle a_{33}^2\rangle _{M_D} \end{array}\right) , \end{aligned}$$

and since $\langle a_{22}^2\rangle _{M_D}=\langle a_{33}^2\rangle _{M_D}$ as it can be seen with the change of variable $A\mapsto P^{23}A(P^{23})^\mathrm{T}$, its eigenvalues are:

$$\begin{aligned} 1-\frac{1}{2}\rho \Big (\langle a_{11}^2\rangle _{M_D}-\langle a_{11}\rangle _{M_D}^2\Big ), \end{aligned}$$

with eigenvector $(1,0,0)^\mathrm{T}$,

$$\begin{aligned} 1-\frac{1}{2}\rho \Big ( \langle a_{22}^2\rangle _{M_D}- \langle a_{22}a_{33}\rangle _{M_D}\Big ), \end{aligned}$$

with eigenvector $(0,1,-1)^\mathrm{T}$ and

$$\begin{aligned} 1-\frac{1}{2}\rho \Big ( \langle a_{22}^2\rangle _{M_D}+ \langle a_{22}a_{33}\rangle _{M_D}\Big ), \end{aligned}$$

with eigenvector $(0,1,1)^\mathrm{T}$. We have as before:

$$\begin{aligned} c_2'(\alpha )=\frac{1}{2}\Big (\langle a_{11}^2\rangle _{M_D}-\langle a_{11}\rangle _{M_D}^2\Big ), \end{aligned}$$

so the first eigenvalue can be rewritten

$$\begin{aligned} 1-\frac{1}{2}\rho \Big (\langle a_{11}^2\rangle _{M_D}-\langle a_{11}\rangle _{M_D}^2\Big )=1-\frac{\alpha c_2'(\alpha )}{c_2(\alpha )}=c_2(\alpha )\left( \frac{id}{c_2}\right) '(\alpha )>0, \end{aligned}$$

where we have used Proposition 4.4 to determine the sign. The two other eigenvalues are equal to:

$$\begin{aligned} 1-\frac{1}{4}\rho \Big \langle (a_{22}\pm a_{33})^2\Big \rangle _{M_D} \end{aligned}$$

Using the change of variable $A\mapsto P^{13}A(P^{13})^\mathrm{T}$ and the parametrisation (10), one can see that:

$$\begin{aligned}&\int _{SO_3({\mathbb {R}})} (a_{22}\pm a_{33})^2 e^{\frac{\alpha }{2}a_{11}}\,\mathrm{d}A=\int _{SO_3({\mathbb {R}})} (a_{22}\pm a_{11})^2 e^{\frac{\alpha }{2}a_{33}}\,\mathrm{d}A\\&\quad =\frac{1}{8\pi ^2}\int _{\theta =0}^{2\pi }\int _{\phi _1=0}^{2\pi }\cos ^2(\phi _1\pm \theta )\int _{\phi _2=0}^{\pi }\sin \phi _2\,(1\pm \cos \phi _2)^2 e^{\frac{\alpha }{2}\cos \phi _2}\,\mathrm{d}\phi _2\,\mathrm{d}\phi _1\,\mathrm{d}\theta \end{aligned}$$

so that:

$$\begin{aligned} \Big \langle (a_{22}\pm a_{33})^2\Big \rangle _{M_D}=\frac{1}{2}\Big [(1\pm \cos \phi )^2\Big ]_{\alpha } \end{aligned}$$

where $[\cdot ]_\alpha $ is defined in Proposition 4.1. Using the relation $\rho =\frac{\alpha }{c_2(\alpha )}=\frac{4}{[ \sin ^2\phi ]_\alpha }$ [see Formula (25)], the two eigenvalues are equal to:

$$\begin{aligned} 1-\frac{1}{2}\frac{\int _0^\pi \sin \phi (1\pm \cos \phi )^2e^{\frac{\alpha }{2}\cos \phi }\,\mathrm{d}\phi }{\int _0^\pi \sin ^3\phi e^{\frac{\alpha }{2}\cos \phi }\,\mathrm{d}\phi }. \end{aligned}$$

With the same technique used in the previous paragraph (Lemma 5.3), it is possible to show that the eigenvalue $1-\frac{1}{4}\rho \Big \langle (a_{22}- a_{33})^2\Big \rangle _{M_D}$ is non-positive when $\alpha <0$ and non-negative when $\alpha >0$. The contrary holds for the other eigenvalue (non-negative when $\alpha <0$ and non-positive when $\alpha >0$). Finally, the signs of these two eigenvalues are always $(+-)$.

The conclusion of the proof follows from the study of the roots of Eqs. (19) and (20) provided by Corollary 4.2 and depicted in Fig. 2. In particular, the cases $\alpha =\alpha ^*$ and $\alpha =0$ correspond to the critical cases $\rho =\rho ^*$ and $\rho =\rho _c$ which are studied in the next section. $\square $

Remark 5.5

The above calculations are similar to the computation of the equilibria and their stability of

$$\begin{aligned} \Psi {:}\, D\in {{\,\mathrm{diag}\,}}({\mathscr {T}})\mapsto {\mathcal {F}}[M_D] \end{aligned}$$

where ${\mathcal {F}}$ is the free-energy (5). In particular, it can be shown that the equilibria of $\Psi $ and their stability are the same as the ones described in Theorem 8. In particular, since (5) is also a free energy in the Fokker–Planck case, this analysis shows the instability of equilibria of the Fokker–Planck of the form $\rho M_{D}$ when D is of one of the unstable equilibria of (28a) described in Theorem 8. This technique is similar to the one which was used in Zhou and Wang (2011) in the case of 3D polymers. However, it does not provide global or local convergence of the solution of the Fokker–Planck equation towards an equilibrium. This requires further investigations which will be left for future work. In the Vicsek case (Degond et al. 2015), it was based on a LaSalle’s principle and on estimates for the dissipation term.

We can finally prove Theorem 7:

Proof of Theorem 7

Thanks to the Duhamel’s formula (26), the asymptotic behaviour of f(t) as $t\rightarrow +\infty $ is given by the asymptotic behaviour of $J_{f(t)}$. Thanks to Proposition 5.2 and Corollary 5.1, we only have to study the asymptotic behaviour of the solution of the ODE (28a) where the initial condition is the diagonal part of a SSVD of $J_{f_0}$. Since this equation has a gradient-flow structure (31a), by analyticity of the potential and boundedness of the trajectories, Łojasiewicz inequality (Haraux 2012; Łojasiewicz 1984) implies that the solution D(t) converges as $t\rightarrow +\infty $ towards an equilibrium $D^{\mathrm {eq}}$ and consequently $J_{f(t)}\rightarrow PD^{\mathrm {eq}}Q:=J^{\mathrm {eq}}$. Moreover the equilibrium $D^{\mathrm {eq}}$ is a stable equilibrium provided that $D_0$ does not belong to the stable manifold of an unstable equilibrium. Since these manifolds have dimension at most 2, the union of these manifolds, called ${\mathcal {N}}_\rho $ is of zero measure and there is convergence towards a stable equilibrium for all $D_0\notin {\mathcal {N}}_\rho $. In this case, since all the equilibria are hyperbolic, the Hartman–Grobman theorem (Perko 2013, Section 2.8) ensures that the convergence is locally exponentially fast in the sense that there exist constants $\delta ,\lambda ,C>0$ such that if $\Vert J_{f_0}-J^{\mathrm {eq}}\Vert \le \delta $ then for all $t>0$,

$$\begin{aligned} \Vert J_{f(t)}-J^{\mathrm {eq}}\Vert \le Ce^{-\lambda t}. \end{aligned}$$

(34)

Let $f^{\mathrm {eq}}=\rho M_{J^{\mathrm {eq}}}$. It follows from Duhamel’s formula (26) that for all $A\in SO_3({\mathbb {R}})$:

$$\begin{aligned} |f(t,A)-f^{\mathrm {eq}}(A)|\le e^{-t}|f_0(A)-f^{\mathrm {eq}}(A)|+\rho e^{-t}\int _0^t e^s|M_{J_{f(s)}}(A)-M_{J^{\mathrm {eq}}}(A)|\mathrm{d}s.\nonumber \\ \end{aligned}$$

(35)

Since $SO_3({\mathbb {R}})$ is compact and $J_{f(t)}$ is bounded uniformly in t, there exists a constant $L>0$ such that for all $t>0$, the following Lipschitz bound holds:

$$\begin{aligned} \forall A\in SO_3({\mathbb {R}}),\,\,\,|M_{J_{f(t)}}(A)-M_{J^{\mathrm {eq}}}(A)|\le L\Vert J_{f(t)}-J^{\mathrm {eq}}\Vert . \end{aligned}$$

(36)

Reporting (36) into (35) and using (34), the first point of Theorem 7 follows with constants $K=CL/|1-\lambda |$ and $\mu =\min (1,\lambda )$ when $\lambda \ne 1$ and $K=CL$ and $\mu =1-\varepsilon $ for any $\varepsilon >0$ when $\lambda =1$.

The stability of the equilibria of the dynamical system (28a) is given in Theorem 8. Finally the conclusions of points 2.(ii) and 2.(iii) follow from the fact that the diagonal parts of the SSVD of the equilibria of type (b) with parameters $\alpha _+$ and $\alpha _1$ are, respectively, $\alpha _+ I_3$ and $\alpha _1 I_3$.$\square $

5.3 Final Remark: Characterisation of the Stable Manifold of the Unstable Equilibria and Critical Cases

This section is devoted to a more precise description of the subset ${\mathscr {N}}_\rho ={{\,\mathrm{diag}\,}}^{-1}({\mathcal {N}}_\rho )$ of zero measure of initial conditions which do not necessarily lead to one of the behaviours detailed in the previous section (see Theorem 7).

When $\rho \ne \rho ^*$ and $\rho \ne \rho _c$, all the equilibria of (28b) are hyperbolic and we can apply the stable manifold theorem (Perko 2013, Section 2.7) which states that for a given hyperbolic equilibrium ${\widehat{D}}^{\mathrm {eq}}$ of the nonlinear equation (28b), the stable set of ${\widehat{D}}^{\mathrm {eq}}$ is a smooth manifold. Its dimension is given by the number of minuses in the signature of $-{{\,\mathrm{Hess}\,}}{\widehat{V}}({\widehat{D}}^{\mathrm {eq}})$, and its tangent space at ${\widehat{D}}^{\mathrm {eq}}$ is the stable subspace of the linearised system around ${\widehat{D}}^{\mathrm {eq}}$. We are now ready to describe the behaviour of the system (28b) for a given density $\rho $ and an initial condition ${\widehat{D}}_0\in {\mathbb {R}}^3$. Note that we can restrict ourselves to the case ${\widehat{D}}_0\in {{\,\mathrm{diag}\,}}^{-1}({\mathscr {D}}_3({\mathbb {R}}))$ and that our goal is to describe ${\mathscr {N}}_\rho ={{\,\mathrm{diag}\,}}^{-1}({\mathcal {N}}_\rho )$ (in the ${\mathbb {R}}^3$-framework). We write $D(t)={{\,\mathrm{diag}\,}}{\widehat{D}}(t)$ the solution of (28a) with initial condition $D_0={{\,\mathrm{diag}\,}}{\widehat{D}}_0$.

Case 1. When $\rho <\rho ^*$

When $\rho <\rho ^*$, there is only the uniform equilibrium: $D(t)\underset{t\rightarrow +\infty }{\longrightarrow }0$ and ${\mathscr {N}}_\rho =\emptyset $.

Case 2. When $\rho ^*<\rho <\rho _c$

There are three equilibria for (28b): two are stable (0 and $\alpha _+ (1,1,1)^\mathrm{T}$) and one is unstable ($\alpha _-(1,1,1)^\mathrm{T}$) which satisfies that $-{{\,\mathrm{Hess}\,}}({\widehat{V}})(\alpha _- (1,1,1)^\mathrm{T})$ has signature $(--+)$. The stable manifold theorem ensures that ${\mathscr {N}}_\rho $ is the stable set of the unstable equilibrium and is a smooth manifold of dimension 2 and its tangent plane at $\alpha _-(1,1,1)^\mathrm{T}$ admits the eigenvectors $(1,-1,0)^\mathrm{T}$ and $(0,1,-1)^\mathrm{T}$ for basis. The unstable manifold of the unstable equilibrium has dimension 1: it is the line of direction $(1,1,1)^\mathrm{T}$.

Finally, the space ${\mathbb {R}}^3$ is partitioned in two domains by ${\mathscr {N}}_\rho $: depending on which domain ${\widehat{D}}_0$ belongs to in ${\mathbb {R}}^3\setminus {\mathscr {N}}_\rho $, ${\widehat{D}}(t)$ will converge towards either the uniform equilibrium or to $\alpha _+(1,1,1)^\mathrm{T}$ (or towards $\alpha _- (1,1,1)^\mathrm{T}$ if ${\widehat{D}}_0$ belongs to its stable manifold).

Case 3. When $\rho >\rho _c$

When $\rho >\rho _c$, ${\widehat{D}}(t)$ can converge towards one of the four equilibria: 0, $\alpha _3(-1,-1,1)^\mathrm{T}$, $\alpha _2(1,0,0)^\mathrm{T}$ or $\alpha _1 (1,1,1)^\mathrm{T}$ (see Fig. 2).

The uniform equilibrium is unstable and $-{{\,\mathrm{Hess}\,}}({\widehat{V}})(0)$ has signature $(+++)$; therefore, there is no stable direction and D(t) cannot converge towards 0 unless $D_0=0$ (and then $D(t)=0$ for all $t\in {\mathbb {R}}_+$).
The equilibrium $\alpha _3{{\,\mathrm{diag}\,}}(-1,-1,1)$ is unstable and its stable manifold ${\mathcal {M}}_1$ has dimension 1: it is the half line ${\mathbb {R}}_+^*\left( \begin{array}{c}1\\ 1\\ -1 \end{array}\right) $ (it is indeed an invariant manifold of dimension 1 and a solution on this half-line cannot converge towards an other equilibrium). Therefore, D(t) converges towards $\alpha _3{{\,\mathrm{diag}\,}}(-1,-1,1)$ if and only if ${\widehat{D}}_0$ lies on this half line.
The equilibrium $\alpha _2(1,0,0)^\mathrm{T}$ is unstable and its stable manifold has dimension 2 with a tangent plane generated by the two vectors $(1,0,0)^\mathrm{T}$ and $(0,1,-1)^\mathrm{T}$: it is the plane $\{d_2+d_3=0\}$. We note that this plane is an invariant manifold, so if ${\widehat{D}}_0$ belongs to this plane, the limit when $t\rightarrow +\infty $ also belongs to this plane. But since the half-lines ${\mathbb {R}}_+^*\left( \begin{array}{c} 1\\ 1\\ -1 \end{array}\right) $ and ${\mathbb {R}}_+^*\left( \begin{array}{c} 1\\ -1\\ 1 \end{array}\right) $ are the stable manifolds of the equilibria $\alpha _3(-1,-1,1)^\mathrm{T}$ and $\alpha _3(-1,1,-1)^\mathrm{T}$, the limit when $t\rightarrow +\infty $ is $\alpha _2(1,0,0)^\mathrm{T}$ if and only if $D_0$ belongs to the (open) quarter of the plane $\{d_2+d_3=0\}$ delimited by these two half-lines: this is the stable manifold of $\alpha _2{{\,\mathrm{diag}\,}}(1,0,0)$. In conclusion, D(t) converges towards $\alpha _2{{\,\mathrm{diag}\,}}(1,0,0)$ if and only if ${\widehat{D}}_0$ is of the form $(d_1,d_2,-d_2)^\mathrm{T}$ with $d_1>d_2\ge 0$. Since ${\widehat{D}}_0\in {{\,\mathrm{diag}\,}}^{-1}({\mathscr {D}}_3({\mathbb {R}}))$, we can restrict ourselves to the eighth of plan ${\mathcal {M}}_2$ delimited by the half lines ${\mathbb {R}}_+^*(1,1,-1)^\mathrm{T}$ excluded and ${\mathbb {R}}_+^*(1,0,0)^\mathrm{T}$ included.
In every other cases, D(t) converges towards $\alpha _1 I_3$ which is the only stable equilibrium.

Finally, in the case $\rho >\rho _c$, the subset ${\mathcal {N}}_\rho \cap {\mathscr {D}}_3({\mathbb {R}})$ is equal to ${{\,\mathrm{diag}\,}}\big (\{0\}\cup {\mathcal {M}}_1\cup {\mathcal {M}}_2\big )$.

The critical cases

We end this section by an informal description of the expected behaviour in the critical cases.

When $\rho =\rho ^*$, the uniform equilibrium is stable and there is an other equilibrium of the form $\alpha ^* I_3$ with $\alpha ^*>0$. This equilibrium is non-hyperbolic: the kernel of $-{{\,\mathrm{Hess}\,}}{\widehat{V}}(\alpha ^* (1,1,1)^\mathrm{T})$ is one dimensional, spanned by the vector $(1,1,1)^\mathrm{T}$. The two other eigenvalues are negative. Therefore, for the system (28b), there exists a centre manifold of dimension 1 and a stable manifold of dimension 2, which tangent plane being the orthogonal of $(1,1,1)^\mathrm{T}$.

If ${\widehat{D}}_0$ belongs to the stable manifold, D(t) converges exponentially fast towards $\alpha ^* I_3$ (Haragus and Iooss 2010, Theorem 3.22). The stable manifold of dimension 2 delimits two domains, and one of them is included in the subset $\{x\in {\mathbb {R}}^3, x\cdot (1,1,1)^\mathrm{T}\ge \alpha ^*\}$. If ${\widehat{D}}_0$ belongs to this domain, then it belongs to a centre manifold and ${\widehat{D}}(t)$ is attracted exponentially fast towards the line ${\mathbb {R}}(1,1,1)^\mathrm{T}$ and converges at rate 1/t towards $\alpha ^* (1,1,1)$. In every other cases, ${\widehat{D}}(t)$ converges exponentially fast towards 0.
When $\rho =\rho _c$ there is one stable equilibrium of the form $\alpha I_3$ with $\alpha >0$. The uniform equilibrium is non-hyperbolic and $-{{\,\mathrm{Hess}\,}}{\widehat{V}}(0)=0$.

In the case when ${\widehat{D}}(t)$ converges towards 0, with a formal computation, the rate of convergence is expected to be either 1/t or $1/\sqrt{t}$ depending on ${\widehat{D}}_0$. The convergence towards the stable anisotropic equilibrium is exponentially fast.

6 Macroscopic Limit for the Stable Equilibria

We now go back to the spatially inhomogeneous model. We want to investigate (at least formally) the hydrodynamic models derived from the BGK equation (3). To do so, we introduce the scaling $t'=\varepsilon t$ and $x'=\varepsilon x$ for $\varepsilon >0$ and we define $f^\varepsilon (t',x',A):=f(t, x, A)$. After this change of variables in (3) and dropping the primes, we see that $f^\varepsilon $ satisfies the following equation:

$$\begin{aligned} \partial _t f^\varepsilon +(Ae_1\cdot \nabla _x)f^\varepsilon = \frac{1}{\varepsilon }\left( \rho _{f^\varepsilon }M_{J_{f^\varepsilon }}-f^\varepsilon \right) . \end{aligned}$$

(37)

We want to investigate the macroscopic limit $\varepsilon \rightarrow 0$ with the assumption that $f^\varepsilon $ converges towards a stable equilibrium. Thanks to the results of the last section, we will assume that $f^\varepsilon \rightarrow \rho M_{\alpha \Lambda }$ where $\rho =\rho (t,x)$, $\alpha =\rho c_1(\alpha )$, $\alpha \in {\mathbb {R}}_+$ and $\Lambda =\Lambda (t,x)\in {\mathscr {M}}_3({\mathbb {R}})$ (with a notion of convergence as strong as needed). Since the equilibrium is assumed to be stable there are two cases: either $\Lambda \in SO_3({\mathbb {R}})$ (and therefore $\rho >\rho ^*$) or $\Lambda =0$ that is to say $f^\varepsilon $ is uniform in the body-orientation variable and converges towards $\rho =\rho (t,x)$ (and therefore $\rho <\rho _c$). For a given time $t\in {\mathbb {R}}_+$, we will say that $x\in {\mathbb {R}}^3$ belongs to a disordered region when $\Lambda (t,x)=0$. Otherwise, when $\Lambda (t,x)\in SO_3({\mathbb {R}})$, we will say that $x\in {\mathbb {R}}^3$ belongs to an ordered region.

The purpose of the two next sections is to write at least formally the hydrodynamic equations satisfied by $\rho =\rho (t,x)$ and $\Lambda =\Lambda (t,x)$. First notice that integrating (37) over $SO_3({\mathbb {R}})$ leads to the conservation law:

$$\begin{aligned} \partial _t \rho ^\varepsilon +\nabla _x\cdot j[f^\varepsilon ]=0, \end{aligned}$$

(38)

where $\rho ^\varepsilon \equiv \rho _{f^\varepsilon }$ and

$$\begin{aligned} j[f^\varepsilon ]:=\int _{SO_3({\mathbb {R}})} Ae_1 f^\varepsilon \,\mathrm{d}A\equiv J_{f^\varepsilon } e_1. \end{aligned}$$

The macroscopic model then depends on the region considered.

1.
In a disordered region, $j[f^\varepsilon ]\rightarrow 0$ and assuming that the convergence is sufficiently strong, we get that $\partial _t\rho =0$. To obtain more information we will look at the next order in the Chapman–Enskog expansion (Sect. 6.1).
2.
In an ordered region, $j[f^\varepsilon ]\rightarrow \alpha \Lambda e_1$ where $\alpha =\rho c_1(\alpha )$ and therefore, assuming that the convergence is strong enough:
$$\begin{aligned} \partial _t \rho +\nabla _x\cdot (\alpha \Lambda e_1)=0. \end{aligned}$$
However, due to the lack of conserved quantities, we will need specific tools to write an equation satisfied by $\Lambda =\Lambda (t,x)$ in order to obtain a closed system of equations on $(\rho ,\Lambda )$. This is the purpose of Sect. 6.2.

6.1 Diffusion Model in a Disordered Region

We consider a region where $f^\varepsilon $ converges as $\varepsilon \rightarrow 0$ to a density $\rho (t,x)$ uniform in the body-attitude variable. The following proposition gives the diffusion model obtained by looking at the next order in the Chapman–Enskog expansion.

Proposition 6.1

(Formal) In a disordered region, the density $\rho ^\varepsilon $ satisfies formally at first order the following diffusion equation:

$$\begin{aligned} \partial _t\rho ^\varepsilon =\varepsilon \nabla _x\cdot \left( \frac{\frac{1}{3}\nabla _x\rho ^\varepsilon }{1-\frac{\rho ^\varepsilon }{\rho _c}}\right) ,\,\,\,\,\,\,\rho _c=6. \end{aligned}$$

(39)

Proof

We follow the same calculations as in Degond et al. (2015): we write $f^\varepsilon =\rho ^\varepsilon +\varepsilon f_1^\varepsilon $ (where $f^\varepsilon _1$ is defined by this relation) and notice that:

$$\begin{aligned} J_{f^\varepsilon }=\varepsilon J_{f_1^\varepsilon }\,\,\,\,\,\text {and}\,\,\,\,\,\,M_{J_{f^\varepsilon }}(A)=1+\varepsilon J_{f_1^{\varepsilon }}\cdot A+{\mathcal {O}}(\varepsilon ^2). \end{aligned}$$

Inserting this in (37), multiplying by A and integrating over $SO_3({\mathbb {R}})$ leads to:

$$\begin{aligned} J_{f_1^\varepsilon }=\rho ^\varepsilon \int _{SO_3({\mathbb {R}})} (J_{f_1^\varepsilon }\cdot A)A\,\mathrm{d}A-\int _{SO_3({\mathbb {R}})} Ae_1\cdot \nabla _x\rho ^\varepsilon A\mathrm{d}A+{\mathcal {O}}(\varepsilon ). \end{aligned}$$

(40)

Using Lemma 3.3, it holds that

$$\begin{aligned} \int _{SO_3({\mathbb {R}})} (J_{f_1^\varepsilon }\cdot A)A\,\mathrm{d}A=\frac{1}{6}J_{f_1^\varepsilon }. \end{aligned}$$

(41)

To compute the second term, we note that $Ae_1\cdot \nabla _x\rho ^\varepsilon =2A\cdot R_{\rho ^\varepsilon }$ where $R_{\rho ^\varepsilon }$ is the matrix, the first column of which is equal to $\nabla _x\rho ^\varepsilon $ and the others are equal to zero. Using Lemma 3.3 we obtain

$$\begin{aligned} \int _{SO_3({\mathbb {R}})} Ae_1\cdot \nabla _x\rho ^\varepsilon A\mathrm{d}A=\frac{1}{3}R_{\rho ^\varepsilon }. \end{aligned}$$

(42)

By multiplying (40) by $e_1$, it follows from (41) and (42) that:

$$\begin{aligned} \left( 1-\frac{\rho ^\varepsilon }{\rho _c}\right) J_{f_1^{\varepsilon }}e_1=-\frac{1}{3}\nabla _x \rho ^\varepsilon +{\mathcal {O}}(\varepsilon ), \end{aligned}$$

which gives the result by inserting this in (38). $\square $

Remark 6.1

This analysis does not depend on the dimension. In $SO_n({\mathbb {R}})$ the same formal result holds:

$$\begin{aligned} \partial _t\rho ^\varepsilon =\varepsilon \nabla _x\cdot \left( \frac{\frac{1}{n}\nabla _x\rho ^\varepsilon }{1-\frac{\rho ^\varepsilon }{\rho _c}}\right) ,\,\,\,\,\,\,\rho _c=2n. \end{aligned}$$

6.2 Self-organised Hydrodynamics in an Ordered Region

In the following, for a given density $\rho \in {\mathbb {R}}_+$, $\alpha (\rho )$ denotes the maximal non-negative root of $\alpha =\rho c_1(\alpha )$. We are going to prove the following theorem.

Theorem 9

(Formal) We suppose that $f^\varepsilon \rightarrow \rho (x,t) M _{J(x,t)}$ (as strongly as necessary) as $\varepsilon \rightarrow 0$ where $J(x,t)=\alpha (\rho (x,t))\Lambda (x,t)$ and $\Lambda (x,t)\in SO_3({\mathbb {R}})$. Then, $\rho $ and $\Lambda $ satisfy the following system of partial differential equations:

$$\begin{aligned}&\partial _t \rho +\nabla _x\cdot (\rho c_1(\alpha (\rho ))\Lambda e_1)=0 , \end{aligned}$$

(43a)

$$\begin{aligned}&\rho (\partial _t\Lambda +{\tilde{c}}_2((\Lambda e_1)\cdot \nabla _x)\Lambda )+\tilde{c_3}[(\Lambda e_1)\times \nabla _x\rho ]_\times \Lambda \nonumber \\&\quad +c_4\rho [-{\mathbf {r}}_x(\Lambda )\times (\Lambda e_1)+\delta _x(\Lambda )\Lambda e_1]_\times \Lambda =0. \end{aligned}$$

(43b)

where ${\tilde{c}}_2$, $\tilde{c_3}$, $c_4$ are functions of $\rho $ to be defined later and $\delta $ and ${\mathbf {r}}$ are the “divergence” and “rotational” operators defined in Degond et al. (2017): if $\Lambda (x)=\exp ([{\mathbf {b}}(x)]_\times )\Lambda (x_0)$ with ${\mathbf {b}}$ smooth around $x_0$ and ${\mathbf {b}}(x_0)=0$, then

$$\begin{aligned} \delta _x(\Lambda )(x_0):=\nabla _x\cdot {\mathbf {b}}(x)|_{x=x_0}\,\,\,\text {and}\,\,\,{\mathbf {r}}_x(\Lambda )(x_0):=\nabla _x\times {\mathbf {b}}(x)|_{x=x_0}, \end{aligned}$$

where $\nabla _x\times $ is the curl operator.

The first equation (43a) is the conservation law (38) and the goal is to obtain the equation (43b) for $\Lambda =\Lambda (t,x)$. However, here and contrary to the classical gas dynamics, the total momentum is not conserved:

$$\begin{aligned} \frac{\mathrm{d}}{\mathrm{d}t}\int _{SO_3({\mathbb {R}})} f A\,\mathrm{d}A\ne 0 \end{aligned}$$

and we therefore cannot deduce easily a closed system of equations. This lack of conserved quantities is specific to self-propelled particle models such as the Vicsek model. The main tool to tackle the problem will be the Generalised Collision Invariants method introduced in Degond and Motsch (2008) for the study of the hydrodynamic limit of the continuum Vicsek model. Its precise setting in the context of our body-attitude model is detailed Sect. 6.2.1. The formal proof of Theorem 9 can be found in Sect. 6.2.2.

6.2.1 Generalised Collision Invariants

To obtain an equation on $\Lambda $, the main tool are the Generalised Collisional Invariants (GCI) first introduced in Degond and Motsch (2008). For a given $J\in {\mathscr {M}}_3({\mathbb {R}})$, we define first the linear collision operator:

$$\begin{aligned} {\mathcal {L}}_J(f)=\rho _f M_J-f, \end{aligned}$$

so that $Q_\mathrm{BGK}(f)={\mathcal {L}}_{J_f}(f)$. Let $J\in {\mathscr {M}}_3({\mathbb {R}})$ with $\det J>0$ and let $\Lambda {:}=PD(J)\in SO_3({\mathbb {R}})$ be the orthogonal part of its polar decomposition. The set of GCI associated with J is defined as:

$$\begin{aligned} {\mathscr {C}}_{J}{:}=\left\{ \psi : SO_3({\mathbb {R}})\rightarrow {\mathbb {R}},\,\,\,\int _{SO_3({\mathbb {R}})} {\mathcal {L}}_J(f)\,\psi \,\mathrm{d}A=0\,\,\,\text {for all}\,\,f\,\,\text {such that}\,\,P_{T_\Lambda }(J_f)=0\right\} .\nonumber \\ \end{aligned}$$

(44)

The condition $\psi \in {\mathscr {C}}_J$ is equivalent to:

$$\begin{aligned} \int _{SO_3({\mathbb {R}})} f(\langle \psi \rangle _{M_J}-\psi )\,\mathrm{d}A=0\,\,\,\,\,\,\text {for all}\,\,\, f\,\,\,\text {such that}\,\,\, P_{T_\Lambda }(J_f)=0. \end{aligned}$$

Therefore, following the ideas of the proof of (Degond et al. 2017, Proposition 4.3), we have:

$$\begin{aligned} \psi \in {\mathscr {C}}_J\,\,\,\Longleftrightarrow \,\,\,\exists B\in T_{\Lambda },\,\,\,\langle \psi \rangle _{M_J}-\psi (A)=B\cdot A, \end{aligned}$$

that is to say:

$$\begin{aligned} \psi \in {\mathscr {C}}_{J}\,\,\,\Longleftrightarrow \,\,\,\exists B\in T_{\Lambda },\,\,\exists C\in {\mathbb {R}},\,\,\,\psi (A)=-B\cdot A + C, \end{aligned}$$

or equivalently since $B\in T_\Lambda $ means that there exists $P\in {\mathscr {A}}_3({\mathbb {R}})$ such that $B=\Lambda P$:

$$\begin{aligned} {\mathscr {C}}_J = {{\,\mathrm{Span}\,}}\left( 1,\underset{P\in {\mathscr {A}}_3({\mathbb {R}})}{\bigcup } \psi ^\Lambda _P\right) , \end{aligned}$$

where

$$\begin{aligned} \psi _P^\Lambda (A)=-P\cdot (\Lambda ^\mathrm{T} A). \end{aligned}$$

Now for any $P\in {\mathscr {A}}_3({\mathbb {R}})$, denoting $\Lambda _{f^\varepsilon }\equiv \Lambda ^\varepsilon =PD(J_{f^\varepsilon })$, we get by multiplying the Eq. (37) by $\psi _P^{\Lambda ^\varepsilon }$:

$$\begin{aligned} \int _{SO_3({\mathbb {R}})} (\partial _t f^\varepsilon +Ae_1\cdot \nabla _x f^\varepsilon )P\cdot ((\Lambda ^\varepsilon )^\mathrm{T}A)\,\mathrm{d}A=0. \end{aligned}$$

(45)

The right-hand side vanishes by Definition (44) of the GCI. Note that if $f^\varepsilon \rightarrow \rho M_J$ with $\det (J)>0$, we have also $\det (J_{f^\varepsilon })>0$ for $\varepsilon $ sufficiently small and $\Lambda ^\varepsilon \in SO_3({\mathbb {R}})$.

6.2.2 Hydrodynamic Limit (Formal Proof of Theorem 9)

Taking formally the limit $\varepsilon \rightarrow 0$ in (45) and since it is true for all $P\in {\mathscr {A}}_3({\mathbb {R}})$, we obtain:

$$\begin{aligned} X{:}=\int _{SO_3({\mathbb {R}})} \Big (\partial _t(\rho M_{\alpha \Lambda })+Ae_1\cdot \nabla _x(\rho M_{\alpha \Lambda })\Big )(\Lambda ^\mathrm{T} A-A^\mathrm{T}\Lambda )\,\mathrm{d}A=0. \end{aligned}$$

We have:

$$\begin{aligned} \partial _t (M_{\alpha \Lambda })= & {} \Big (\partial _t(\alpha \Lambda )\cdot A - \langle \partial _t(\alpha \Lambda )\cdot A\rangle _{M_{\alpha \Lambda }}\Big ) M_{\alpha \Lambda } \\= & {} \alpha '\partial _t\rho (A\cdot \Lambda -\langle A\cdot \Lambda \rangle _{M_{\alpha \Lambda }})M_{\alpha \Lambda } + \alpha \partial _t \Lambda \cdot (A-\langle A\rangle _{M_{\alpha \Lambda }})M_{\alpha \Lambda }, \end{aligned}$$

where $\alpha '$ denotes the derivative of $\alpha (\rho )$ with respect to $\rho $ and similarly for $\partial _i(M_{\alpha \Lambda })$. With this we compute the term:

$$\begin{aligned}&(\partial _t+Ae_1\cdot \nabla _x)(\rho M_{\alpha \Lambda }) \\&= M_{\alpha \Lambda }(A)\left( 1+\rho \alpha '\left( A\cdot \Lambda -\frac{3}{2}c_1(\alpha )\right) \right) (\partial _t+Ae_1\cdot \nabla _x)\rho \\&\qquad + M_{\alpha \Lambda }(A)\rho \alpha A\cdot (\partial _t+Ae_1\cdot \nabla _x)\Lambda , \end{aligned}$$

where we have used that:

$$\begin{aligned} \langle A\cdot \Lambda \rangle _{M_{\alpha \Lambda }} = \frac{3}{2}c_1(\alpha ), \end{aligned}$$

and

$$\begin{aligned} \Lambda \cdot (\partial _t+Ae_1\cdot \nabla _x)\Lambda =0. \end{aligned}$$

Most of the terms that appear in X are computed in Degond et al. (2017). Precisely,

$$\begin{aligned} X=X_1+X_2+X_3+X_4+Y \end{aligned}$$

(46)

where $X_1$, $X_2$, $X_3$ and $X_4$ are computed in Degond et al. (2017):

$$\begin{aligned}&X_1:=\int _{SO_3({\mathbb {R}})}\partial _t\rho M_{\alpha \Lambda }(A)(\Lambda ^\mathrm{T}A-A^\mathrm{T}\Lambda )\,\mathrm{d}A=0,\\&X_2:=\int _{SO_3({\mathbb {R}})} \alpha \rho (A\cdot \partial _t\Lambda ) M_{\alpha \Lambda }(A)(\Lambda ^\mathrm{T} A-A^\mathrm{T}\Lambda )\,\mathrm{d}A=C_2\rho \alpha \Lambda ^\mathrm{T}\partial _t\Lambda ,\\&X_3:=\int _{SO_3({\mathbb {R}})} Ae_1\cdot \nabla _x\rho M_{\alpha \Lambda }(A)(\Lambda ^\mathrm{T}A-A^\mathrm{T}\Lambda )\,\mathrm{d}A=C_3[e_1\times \Lambda ^\mathrm{T}\nabla _x\rho ]_\times ,\\&X_4:=\int _{SO_3({\mathbb {R}})} \rho \alpha \big (A\cdot (Ae_1\cdot \nabla _x)\Lambda \big )M_{\alpha \Lambda }(A)(\Lambda ^\mathrm{T}A-A^\mathrm{T}\Lambda )\,\mathrm{d}A\\&\quad =\rho \alpha (C_4[Le_1]_\times +C_5[L^\mathrm{T} e_1+{{\,\mathrm{Tr}\,}}(L)e_1]_\times ), \end{aligned}$$

where the coefficients

$$\begin{aligned} C_2= & {} C_3:=\frac{2}{3}\{\sin ^2\theta \}_\alpha ,\,\,\,\,C_4:=\frac{2}{15}\{\sin ^2\theta (1+4\cos \theta )\}_\alpha ,\,\,\,\,\text {and}\\ C_5:= & {} \frac{2}{15}\{\sin ^2\theta (1-\cos \theta )\}_\alpha , \end{aligned}$$

and the matrix

$$\begin{aligned} L:=\Lambda ^\mathrm{T}{\mathcal {D}}_x(\Lambda )\Lambda , \end{aligned}$$

are the same as in Degond et al. (2017). The matrix ${\mathcal {D}}_x(\Lambda )\in {\mathscr {M}}_3({\mathbb {R}})$ is defined as the unique matrix such that for all ${\mathbf {w}}\in {\mathbb {R}}^3$, and smooth functions $\Lambda :{\mathbb {R}}^3\rightarrow SO_3({\mathbb {R}})$,

$$\begin{aligned} ({\mathbf {w}}\cdot \nabla _x)\Lambda =[{\mathcal {D}}_x(\Lambda ){\mathbf {w}}]_\times \Lambda \end{aligned}$$

(see Degond et al. 2017, Section 4.5). Note that $C_3=C_2$ since the noise and alignment parameters which were denoted by $\nu $ and d in Degond et al. (2017) have been taken equal to 1 here. Note also that these coefficients are functions of $\rho $ (through $\alpha $ only). The term Y is an additional term which appears here due to the presence of the parameter $\alpha =\alpha (\rho )$ which is a function of $\rho $. It depends also on the derivative $\alpha '$ of $\alpha $:

$$\begin{aligned} Y{:}=\rho \alpha '\int _{SO_3({\mathbb {R}})} (\Lambda ^\mathrm{T}A-A^\mathrm{T}\Lambda )M_{\alpha \Lambda }(A)\left( A\cdot \Lambda -\frac{3}{2}c_1(\alpha )\right) (\partial _t+Ae_1\cdot \nabla _x)\rho \,\mathrm{d}A. \end{aligned}$$

All the terms that involve the time derivative of $\rho $ are equal to zero since $\partial _t\rho $ does not depend on A and with the change of variable $A' =\Lambda A^\mathrm{T}\Lambda $ which has unit Jacobian, it holds that $A'\cdot \Lambda =A\cdot \Lambda $ and therefore

$$\begin{aligned}&\int _{SO_3({\mathbb {R}})} (\Lambda ^\mathrm{T}A-A^\mathrm{T}\Lambda )M_{\alpha \Lambda }(A)\left( A\cdot \Lambda -\frac{3}{2}c_1(\alpha )\right) \,\mathrm{d}A\\&\quad =-\int _{SO_3({\mathbb {R}})} (\Lambda ^\mathrm{T}A'-A'^\mathrm{T}\Lambda )M_{\alpha \Lambda }(A')\left( A'\cdot \Lambda -\frac{3}{2}c_1(\alpha )\right) \,\mathrm{d}A'=0. \end{aligned}$$

We thus have:

$$\begin{aligned} Y=Y_1+Y_2, \end{aligned}$$

where

$$\begin{aligned} Y_1{:} = \rho \alpha '\int _{SO_3({\mathbb {R}})} (A\cdot \Lambda )(Ae_1\cdot \nabla _x)\rho \,(\Lambda ^\mathrm{T}A-A^\mathrm{T}\Lambda )M_{\alpha \Lambda }(A)\,\mathrm{d}A, \end{aligned}$$

and

$$\begin{aligned} Y_2{:}= \frac{3}{2}c_1(\alpha )\rho \alpha '\int _{SO_3({\mathbb {R}})} (Ae_1\cdot \nabla _x)\rho \, (\Lambda ^\mathrm{T} A-A^\mathrm{T}\Lambda )M_{\alpha \Lambda }(A)\,\mathrm{d}A. \end{aligned}$$

With the change of variable $A\mapsto \Lambda ^\mathrm{T} A$, these terms become

$$\begin{aligned} Y_1= & {} \rho \alpha '\int _{SO_3({\mathbb {R}})} (\Lambda Be_1\cdot \nabla _x\rho )(B-B^\mathrm{T})(B\cdot I_3) M_{\alpha I_3}(B)\,dB,\\ Y_2{:}= & {} \frac{3}{2}c_1(\alpha )\rho \alpha ' \int _{SO_3({\mathbb {R}})} (\Lambda Be_1\cdot \nabla _x\rho )(B-B^\mathrm{T})M_{\alpha I_3}(B)\,dB, \end{aligned}$$

and they can be computed using the same techniques as in Degond et al. (2017) or Lemma 3.4 in the appendix. More precisely, we can write

$$\begin{aligned} \Lambda Be_1\cdot \nabla _x\rho = B\cdot R_{1,\rho }, \end{aligned}$$

where $R_{1,\rho }$ is the matrix, the first column of which is equal to $2\Lambda ^\mathrm{T} \nabla _x\rho $ and the others are all equal to zero. It satisfies:

$$\begin{aligned} \frac{R_{1,\rho }-R_{1,\rho }^\mathrm{T}}{2}= [e_1\times \Lambda ^\mathrm{T}\nabla _x\rho ]_\times . \end{aligned}$$

By the change of variable $B\mapsto B^\mathrm{T}$, we have

$$\begin{aligned} Y_1= & {} \rho \alpha '\int _{SO_3({\mathbb {R}})} (B\cdot R_{1,\rho })(B-B^\mathrm{T})(B\cdot I_3) M_{\alpha I_3}(B)\,dB,\\= & {} \rho \alpha '\int _{SO_3({\mathbb {R}})} A\cdot (R_{1,\rho }-R_{1,\rho }^\mathrm{T})A(A\cdot I_3)M_{\alpha I_3}(A)\,\mathrm{d}A. \end{aligned}$$

This integral is of the form (47) where

$$\begin{aligned} g(A):=A\cdot I_3M_{\alpha I_3}(A) \end{aligned}$$

is invariant by transposition and conjugation and

$$\begin{aligned} J:=R_{1,\rho }-R_{1,\rho }^\mathrm{T} \end{aligned}$$

is a skew-symmetric matrix. From (49) and (50) we get

$$\begin{aligned} Y_1=\rho \alpha \mu (R_{1,\rho }-R_{1,\rho }^\mathrm{T}) \end{aligned}$$

with

$$\begin{aligned} \mu :=\frac{1}{8}\int _{SO_3({\mathbb {R}})}(a_{21}-a_{12})^2{{\,\mathrm{Tr}\,}}(A)M_{\alpha I_3}(A)\,\mathrm{d}A. \end{aligned}$$

The first term $Y_1$ can therefore be written:

$$\begin{aligned} Y_1=2\mu \rho \alpha '[e_1\times \Lambda ^\mathrm{T}\nabla _x\rho ]_\times , \end{aligned}$$

and using Rodrigues’ formula we obtain:

$$\begin{aligned} 2\mu =\frac{1}{3}\Big \{ \left( 1+2\cos \theta \right) \sin ^2(\theta )\Big \}_{\alpha } \end{aligned}$$

where $\{\cdot \}_\alpha $ has been defined in Proposition 4.1. Similarly, the second term $Y_2$ can be written:

$$\begin{aligned} Y_2=\frac{3}{2}c_1(\alpha ) C_3\rho \alpha '[e_1\times \Lambda ^\mathrm{T}\nabla _x\rho ]_\times , \end{aligned}$$

where the coefficient $C_3$ is the same as in Degond et al. (2017):

$$\begin{aligned} C_3=\frac{2}{3}\{\sin ^2\theta \}_\alpha =C_2. \end{aligned}$$

Finally, we obtain:

$$\begin{aligned} Y=\rho \alpha '\left( \frac{3}{2}c_1(\alpha )C_3+\frac{1}{3}\Big \{ \left( 1+2\cos \theta \right) \sin ^2(\theta )\Big \}_{\alpha }\right) [e_1\times \Lambda ^\mathrm{T}\nabla _x\rho ]_\times . \end{aligned}$$

Putting all the terms together, we can conclude as in Degond et al. (2017). First we notice that:

$$\begin{aligned} {{\,\mathrm{Tr}\,}}(L)= & {} \delta _x(\Lambda ),\,\,\,\,[\Lambda L^\mathrm{T} e_1]_\times =[({\mathcal {D}}_x(\Lambda )-[{\mathbf {r}}_x(\Lambda )]_\times )\Lambda e_1]_\times \,\,\,\,\text {and}\,\,\,\,[\Lambda Le_1]_\times \Lambda \\= & {} \big ((\Lambda e_1)\cdot \nabla _x\big )\Lambda . \end{aligned}$$

Therefore, we obtain by multiplying (46) by $\Lambda $ and dividing by $\alpha C_2$:

$$\begin{aligned}&\rho (\partial _t\Lambda +c_2((\Lambda e_1)\cdot \nabla _x)\Lambda )+\tilde{c_3}[(\Lambda e_1)\times \nabla _x\rho ]_\times \Lambda \\&\quad +c_4\rho [-{\mathbf {r}}_x(\Lambda )\times (\Lambda e_1)+\delta _x(\Lambda )\Lambda e_1]_\times \Lambda =0, \end{aligned}$$

where the coefficients

$$\begin{aligned} {\tilde{c}}_2:=\frac{C_4+C_5}{C_2}=\frac{1}{5}\frac{\{\sin ^2\theta (2+3\cos \theta )\}_\alpha }{\{\sin ^2\theta \}_\alpha }\,\,\,\,\text {and}\,\,\,\,c_4:=\frac{C_5}{C_2}=\frac{1}{5}\frac{\{\sin ^2\theta (1-\cos \theta )\}_\alpha }{\{\sin ^2\theta \}_\alpha } \end{aligned}$$

are, respectively, equal to the coefficients $c_2$ and $c_4$ in Degond et al. (2017) and the coefficient $c_3$ in Degond et al. (2017) (which is equal to 1) becomes:

$$\begin{aligned} \tilde{c_3}=\frac{1}{\alpha }+\frac{\rho \alpha '}{\alpha }\left( \frac{3}{2}c_1(\alpha )+\frac{1}{2}\frac{\Big \{(1+2\cos \theta )\sin ^2\theta \Big \}_\alpha }{\{ \sin ^2\theta \}_\alpha }\right) . \end{aligned}$$

7 Conclusion and Perspectives

In this work, we have presented a new BGK model of body-attitude coordination where agents are described by a rotation matrix. Starting from the kinetic level (a space homogeneous BGK equation), we have drawn a parallel between our Vicsek-type model and the models of nematic alignment of polymers. We then have deduced the equilibria of the system and have shown a phase transition phenomenon triggered by the density of agents. Thanks to a gradient-flow structure specific to the BGK equation, we have been able to describe the asymptotic behaviour of the system. Finally, we have derived the macroscopic models (SOHB) in the spatially inhomogeneous case.

This model and its analysis raise many open questions. At the kinetic level, research perspectives include the following:

1.
It would be interesting to quantify the time that takes a solution to reach the basin of attraction of a stable equilibria. This question has been addressed for other types of collective-dynamics models, for instance in Morales and Poyato (2019).
2.
Our study relies on the dimension 3 and it would be interesting to extend the ideas developed here in $SO_n({\mathbb {R}})$, $n\ge 3$, for example by drawing new parallels with higher dimensional polymers models or other similar models Giacomin et al. (2012b), Giacomin et al. (2012a).
3.
The space-inhomogeneous case, not considered here, is much more difficult to tackle due to the transport operator which prevents the reduction to a gradient-flow (in finite dimension for the BGK operator and in infinite dimension for the Fokker–Planck operator). This situation is not specific to the body-orientation model presented here, and the question is largely open even in the Vicsek case (see the conclusion of Degond et al. 2015). Numerical simulations of the individual-based model suggest that clusterisation phenomena appear: the space can be decomposed in large-density regions (the clusters) and low-density regions. Inside a cluster, flocking is observed which is consistent with the fact that it is the only stable equilibrium in large density regions. Between the clusters, a diffusive behaviour is observed, and again, it seems consistent with the analysis in the low-density (homogenous) case presented in this article. The dynamics of the clusters (which can be seen as a kind of free-boundary problem) remains an open problem.

At the macroscopic level, the mathematical and numerical analyses of the SOHB model formally derived here are still in progress. The factor $\rho $ in front of the transport term of Eq. (43b) suggests that a singularity when $\rho \rightarrow 0$ could prevent well-posedness in low-density regions. A detailed investigation of the domain of validity of the SOHB model (and the diffusive model) at the macroscopic scale would be desirable. In particular, it is not known whether there are cases where the dynamics creates “empty spaces” in finite time. The results at the kinetic level on the stability of the uniform and flocking equilibria could give a first indication. However, it is not clear how these quantitative results at the kinetic level can be transposed at the macroscopic level through the rescaling procedure.

The BGK model studied here is a step towards the full description of the models of collective behaviour depicted in Fig. 1. The tools and ideas that we have presented here may help to analyse other models of body-attitude coordination such as the non-normalised Fokker–Planck model in $SO_3({\mathbb {R}})$. Other models which take into account curvature control in addition to body-orientation, in the spirit of Degond and Motsch (2011), could also be considered.

References

Albi, G., Bellomo, N., Fermo, L., Kim, J., Pareschi, L., Poyato, D., Soler, J., et al.: Vehicular traffic, crowds, and swarms: from kinetic theory and multiscale methods to applications and research perspectives. Math. Models Methods Appl. Sci. 29(10), 1901–2005 (2019)
MathSciNet MATH Google Scholar
Ball, J.M.: Mathematics and liquid crystals. Mol. Cryst. Liq. Cryst. 647(1), 1–27 (2017)
MATH Google Scholar
Ball, J.M., Majumdar, A.: Nematic liquid crystals: from Maier–Saupe to a continuum theory. Mol. Cryst. Liq. Cryst. 525(1), 1–11 (2010)
Google Scholar
Bhatnagar, P.L., Gross, E.P., Krook, M.: A model for collision processes in gases. I. Small amplitude processes in charged and neutral one-component systems. Phys. Rev. 94(3), 511 (1954)
MATH Google Scholar
Bolley, F., Canizo, J.A., Carrillo, J.A.: Stochastic mean-field limit: non-Lipschitz forces and swarming. Math. Models Methods Appl. Sci. 21(11), 2179–2210 (2011)
MathSciNet MATH Google Scholar
Bolley, F., Cañizo, J.A., Carrillo, J.A.: Mean-field limit for the stochastic Vicsek model. Appl. Math. Lett. 25(3), 339–343 (2012)
MathSciNet MATH Google Scholar
Caflisch, R.E.: The fluid dynamic limit of the nonlinear Boltzmann equation. Comm. Pure Appl. Math. 33(5), 651–666 (1980)
MathSciNet MATH Google Scholar
Carrillo, J.A., Choi, Y.-P., Perez, S.P.: A review on attractive-repulsive hydrodynamics for consensus in collective behavior. In: Bellomo, N., Degond, P., Tadmor, E. (eds.) Active Particles, vol. 1, pp. 259–298. Springer, Berlin (2017)
Google Scholar
Cercignani, C., Illner, R., Pulvirenti, M.: The Mathematical Theory of Dilute Gases, vol. 106. Springer, Berlin (2013)
MATH Google Scholar
Chuang, Y.-L., D’Orsogna, M.R., Marthaler, D., Bertozzi, A.L., Chayes, L.S.: State transitions and the continuum limit for a 2d interacting, self-propelled particle system. Phys. D 232(1), 33–47 (2007)
MathSciNet MATH Google Scholar
Cucker, F., Smale, S.: Emergent behavior in flocks. IEEE Trans. Autom. Control 52(5), 852–862 (2007)
MathSciNet MATH Google Scholar
Degond, P.: Macroscopic limits of the Boltzmann equation: a review. In: Degond, P., Pareschi, L., Russo, G. (eds.) Modeling and Computational Methods for Kinetic Equations, pp. 3–57. Springer, Berlin (2004)
Google Scholar
Degond, P., Motsch, S.: Continuum limit of self-driven particles with orientation interaction. Math. Models Methods Appl. Sci. 18(supp01), 1193–1215 (2008)
MathSciNet MATH Google Scholar
Degond, P., Motsch, S.: A macroscopic model for a system of swarming agents using curvature control. J. Stat. Phys. 143(4), 685–714 (2011)
MathSciNet MATH Google Scholar
Degond, P., Navoret, L.: A multi-layer model for self-propelled disks interacting through alignment and volume exclusion. Math. Models Methods Appl. Sci. 25(13), 2439–2475 (2015)
MathSciNet MATH Google Scholar
Degond, P., Frouvelle, A., Liu, J.-G.: Macroscopic limits and phase transition in a system of self-propelled particles. J. Nonlinear Sci. 23(3), 427–456 (2013)
MathSciNet MATH Google Scholar
Degond, P., Frouvelle, A., Liu, J.-G.: Phase transitions, hysteresis, and hyperbolicity for self-organized alignment dynamics. Arch. Ration. Mech. Anal. 216(1), 63–115 (2015)
MathSciNet MATH Google Scholar
Degond, P., Frouvelle, A., Merino-Aceituno, S.: A new flocking model through body attitude coordination. Math. Models Methods Appl. Sci. 27(06), 1005–1049 (2017)
MathSciNet MATH Google Scholar
Degond, P., Frouvelle, A., Merino-Aceituno, S., Trescases, A.: Alignment of self-propelled rigid bodies: from particle systems to macroscopic equations. arXiv preprint arXiv:1810.06903 (2018a)
Degond, P., Frouvelle, A., Merino-Aceituno, S., Trescases, A.: Quaternions in collective dynamics. Multiscale Model. Simul. 16(1), 28–77 (2018b)
MathSciNet MATH Google Scholar
Degond, P., Frouvelle, A., Merino-Aceituno, S., Trescases, A.: Hyperbolicity of SOHB Models (2020)
Diez, A.: Propagation of chaos and moderate interaction for a piecewise deterministic system of geometrically enriched particles. arXiv preprint arXiv:1908.00293 (2019)
Dimarco, G., Motsch, S.: Self-alignment driven by jump processes: macroscopic limit and numerical investigation. Math. Models Methods Appl. Sci. 26(07), 1385–1410 (2016)
MathSciNet MATH Google Scholar
Esposito, R., Guo, Y., Kim, C., Marra, R.: Stationary solutions to the Boltzmann equation in the hydrodynamic limit. Ann. PDE 4(1), 1 (2018)
MathSciNet MATH Google Scholar
Figalli, A., Kang, M.-J., Morales, J.: Global well-posedness of the spatially homogeneous Kolmogorov–Vicsek model as a gradient flow. Arch. Ration. Mech. Anal. 227(3), 869–896 (2018)
MathSciNet MATH Google Scholar
Gallagher, I., Saint-Raymond, L., Texier, B.: From Newton to Boltzmann: hard spheres and short-range potentials. Zürich lectures in advanced mathematics. European Mathematical Society (2013). ISBN: 9783037191293
Gamba, I.M., Kang, M.-J.: Global weak solutions for Kolmogorov–Vicsek type equations with orientational interactions. Arch. Ration. Mech. Anal. 222(1), 317–342 (2016)
MathSciNet MATH Google Scholar
Giacomin, G., Pakdaman, K., Pellegrin, X.: Global attractor and asymptotic dynamics in the Kuramoto model for coupled noisy phase oscillators. Nonlinearity 25(5), 1247 (2012a)
MathSciNet MATH Google Scholar
Giacomin, G., Pakdaman, K., Pellegrin, X., Poquet, C.: Transitions in active rotator systems: invariant hyperbolic manifold approach. SIAM J. Math. Anal. 44(6), 4165–4194 (2012b)
MathSciNet MATH Google Scholar
Golse, F., Saint-Raymond, L.: The Navier–Stokes limit of the Boltzmann equation for bounded collision kernels. Invent. Math. 155(1), 81–161 (2004)
MathSciNet MATH Google Scholar
Guo, Y., Jang, J.: Global Hilbert expansion for the Vlasov–Poisson–Boltzmann system. Commun. Math. Phys. 299(2), 469–501 (2010)
MathSciNet MATH Google Scholar
Ha, S.-Y., Liu, J.-G., et al.: A simple proof of the Cucker–Smale flocking dynamics and mean-field limit. Commun. Math. Sci. 7(2), 297–325 (2009)
MathSciNet MATH Google Scholar
Han, J., Luo, Y., Wang, W., Zhang, P., Zhang, Z.: From microscopic theory to macroscopic theory: a systematic study on modeling for liquid crystals. Arch. Ration. Mech. Anal. 215(3), 741–809 (2015)
MathSciNet MATH Google Scholar
Haragus, M., Iooss, G.: Local Bifurcations, Center Manifolds, and Normal Forms in Infinite-Dimensional Dynamical Systems. Springer, Berlin (2010)
MATH Google Scholar
Haraux, A.: Some applications of the Łojasiewicz gradient inequality. Commun. Pure Appl. Anal. 6, 2417–2427 (2012)
MATH Google Scholar
Hauray, M., Jabin, P.-E.: N-particles approximation of the Vlasov equations with singular potential. Arch. Ration. Mech. Anal. 183(3), 489–524 (2007)
MathSciNet MATH Google Scholar
Hemelrijk, C.K., Hildenbrandt, H.: Schools of fish and flocks of birds: their shape and internal structure by self-organization. Interface Focus 2(6), 726–737 (2012)
Google Scholar
Hemelrijk, C.K., Hildenbrandt, H., Reinders, J., Stamhuis, E.J.: Emergence of oblong school shape: models and empirical data of fish. Ethology 116(11), 1099–1112 (2010)
Google Scholar
Hildenbrandt, H., Carere, C., Hemelrijk, C.K.: Self-organized aerial displays of thousands of starlings: a model. Behav. Ecol. 21(6), 1349–1359 (2010)
Google Scholar
Hirsch, M.W., Smale, S., Devaney, R.L.: Differential Equations, Dynamical Systems, and an Introduction to Chaos. Academic Press, London (2012)
MATH Google Scholar
Horn, A.: Doubly stochastic matrices and the diagonal of a rotation matrix. Am. J. Math. 76(3), 620–630 (1954)
MathSciNet MATH Google Scholar
Jabin, P.-E.: A review of the mean field limits for Vlasov equations. Kinet. Relat. Models 7(4), 661–711 (2014)
MathSciNet MATH Google Scholar
Jiang, N., Xiong, L., Zhang, T.-F.: Hydrodynamic limits of the kinetic self-organized models. SIAM J. Math. Anal. 48(5), 3383–3411 (2016)
MathSciNet MATH Google Scholar
Jiang, N., Luo, Y.-L., Zhang, T.-F.: Coupled self-organized hydrodynamics and Navier–Stokes models: local well-posedness and the limit from the self-organized kinetic-fluid models. arXiv preprint arXiv:1712.10134 (2017)
Jourdain, B., Méléard, S.: Propagation of chaos and fluctuations for a moderate model with smooth initial data. Ann. Inst. Henri Poincaré Probab. Stat. 34(6), 727–766 (1998)
MathSciNet MATH Google Scholar
Kac, M.: Foundations of kinetic theory. In: Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, Vol. 3, pp. 171–197. University of California Press Berkeley and Los Angeles, CA (1956)
Lanford, O.E.: Time evolution of large classical systems. In: Moser, J. (ed.) Dynamical Systems, Theory and Applications, pp. 1–111. Springer, Berlin (1975)
Google Scholar
Lax, P.D.: Linear Algebra and Its Applications, 2nd edn. Wiley, New York (2007)
MATH Google Scholar
Łojasiewicz, S.: Sur les trajectoires du gradient d’une fonction analytique. Seminari di geometria, Univ. Stud. Bologna, Bologna 1982–1983, 115–117 (1984)
MathSciNet MATH Google Scholar
Marchetti, M.C., Joanny, J.F., Ramaswamy, S., Liverpool, T.B., Prost, J., Rao, M., Simha, R.A.: Hydrodynamics of soft active matter. Rev. Mod. Phys. 85(3), 1143–1189 (2013)
Google Scholar
Mischler, S., Mouhot, C.: Kac’s program in kinetic theory. Invent. Math. 193(1), 1–147 (2013)
MathSciNet MATH Google Scholar
Morales, J., Poyato, D.: On the trend to global equilibrium for Kuramoto Oscillators. arXiv preprint arXiv:1908.07657 (2019)
Motsch, S., Tadmor, E.: A new model for self-organized dynamics and its flocking behavior. J. Stat. Phys. 144(5), 923 (2011)
MathSciNet MATH Google Scholar
Oelschläger, K.: A law of large numbers for moderately interacting diffusion processes. Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete 69(2), 279–322 (1985)
MathSciNet MATH Google Scholar
Perko, L.: Differential Equations and Dynamical Systems, vol. 7. Springer, Berlin (2013)
MATH Google Scholar
Perthame, B.: Global existence to the BGK model of Boltzmann equation. J. Differ. Equ. 82, 191–205 (1989)
MathSciNet MATH Google Scholar
Quarteroni, A., Sacco, R., Saleri, F.: Numerical Mathematics, vol. 37. Springer, Berin (2010)
MATH Google Scholar
Saint-Raymond, L.: From the BGK model to the Navier–Stokes equations. Ann. Sci. Éc. Norm. Supér. 36(4), 271–317 (2003)
MathSciNet MATH Google Scholar
Salamin, E.: Application of quaternions to computation with rotations. Technical report, Working Paper (1979)
Sznitman, A.-S.: Topics in propagation of chaos. In: Hennequin, P.-L. (ed.) Éc. Été Probab. St.-Flour XIX—1989, pp. 165–251. Springer, Berlin (1991)
Vicsek, T., Czirók, A., Ben-Jacob, E., Cohen, I., Shochet, O.: Novel type of phase transition in a system of self-driven particles. Phys. Rev. Lett. 75(6), 1226 (1995)
MathSciNet Google Scholar
Wang, H., Hoffman, P., et al.: A unified view on the rotational symmetry of equilibiria of nematic polymers, dipolar nematic polymers, and polymers in higher dimensional space. Commun. Math. Sci. 6(4), 949–974 (2008)
MathSciNet MATH Google Scholar
Zhang, T.-F., Jiang, N.: A local existence of viscous self-organized hydrodynamic model. Nonlinear Anal. Real World Appl. 34, 495–506 (2017)
MathSciNet MATH Google Scholar
Zhou, H., Wang, H.: Stability of equilibria of nematic liquid crystalline polymers. Acta Math. Sci. 31(6), 2289–2304 (2011)
MathSciNet MATH Google Scholar

Download references

Acknowledgements

PD acknowledges support by the Engineering and Physical Sciences Research Council (EPSRC) under Grants Nos. EP/M006883/1 and EP/P013651/1, by the Royal Society and the Wolfson Foundation through a Royal Society Wolfson Research Merit Award No. WM130048 and by the National Science Foundation (NSF) under Grant No. RNMS11-07444 (KI-Net). PD is on leave from CNRS, Institut de Mathématiques de Toulouse, France. A. D. acknowledges the hospitality of CEREMADE, Université Paris-Dauphine and the University of Sussex where part of this research was carried out. A.D. was supported for this work by the École Normale Supérieure de Rennes through an Erasmus+ grant. This work was initiated at the École Normale Supérieure de Lyon as part of a Master degree delivered to A.D. A.F. acknowledges support from the EFI project ANR-17-CE40-0030 and the Kibord Project ANR-13-BS01-0004 of the French National Research Agency (ANR), from the project Défi S2C3 POSBIO of the interdisciplinary mission of CNRS, and the project SMS co-funded by CNRS and the Royal Society. SMA is supported by the Vienna Science and Technology Fund (WWTF) with a Vienna Research Groups for Young Investigators, grant VRG17-014. The authors wish to acknowledge the anonymous referees for useful comments and hints.

Author information

Authors and Affiliations

Department of Mathematics, Imperial College London, South Kensington Campus, London, SW7 2AZ, UK
P. Degond & A. Diez
CEREMADE, CNRS, Université Paris-Dauphine, Université PSL, 75016, Paris, France
A. Frouvelle
Faculty of Mathematics, University of Vienna, Oskar-Morgenstern-Platz 1, 1090, Wien, Austria
S. Merino-Aceituno
Department of Mathematics, University of Sussex, Falmer, BN1 9RH, UK
S. Merino-Aceituno

Authors

P. Degond
View author publications
You can also search for this author in PubMed Google Scholar
A. Diez
View author publications
You can also search for this author in PubMed Google Scholar
A. Frouvelle
View author publications
You can also search for this author in PubMed Google Scholar
S. Merino-Aceituno
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to P. Degond.

Ethics declarations

Data statement

No new data were collected in the course of this research.

Conflict of interest

The authors have no conflict of interest to declare.

Additional information

Communicated by Paul Newton.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Quaternions and Rotations

This appendix is devoted to the proof of Proposition 4.3. We also give additional results about quaternions. The following lemma gives a link between quaternions and the theory of Q-tensors.

Lemma A.1

Let ${\mathscr {S}}_4^0({\mathbb {R}})$ be the space of symmetric $4\times 4$ trace free matrices. If $Q\in {\mathscr {S}}_4^0({\mathbb {R}})$ has two eigenvalues with eigenspaces of dimensions 1 and 3, then Q can be written

$$\begin{aligned} Q=\alpha \left( q\otimes q-\frac{1}{4}I_4\right) , \end{aligned}$$

for a given unit quaternion q seen as a vector of ${\mathbb {R}}^4$. A matrix of this form is called a uniaxial Q-tensor. When $\alpha =1$, we will say that Q is a normalised uniaxial Q-tensor.

Proof

Let $Q\in {\mathscr {S}}_4^0({\mathbb {R}})$ such that Q has two eigenvalues with eigenspaces of dimensions 1 and 3. By the spectral theorem, there exists $P\in {\mathcal {O}}_3({\mathbb {R}})$ such that for a given $\alpha >0$:

$$\begin{aligned} Q=\frac{\alpha }{4}P{{\,\mathrm{diag}\,}}(3,-1,-1,-1)P^\mathrm{T}=\alpha P{{\,\mathrm{diag}\,}}(1,0,0,0)P^\mathrm{T}-\frac{\alpha }{4} I_4, \end{aligned}$$

and the result follows by taking q equal to the first column of P. $\square $

Proof of Proposition 4.3

1.
The group isomorphism $\Phi $ is explicitly computed in Salamin (1979). In particular, let $q\in {\mathbb {S}}^3/\pm 1$. The matrix $A=\Phi (q)$ is defined for all purely imaginary quaternion $u\in {\mathbb {H}}$ by $A[u]=[quq^*]$ where $(u_1,u_2,u_3)^\mathrm{T}=:[u]\in {\mathbb {R}}^3$ is the vector associated with $u=u_1i+u_2j+u_3k$. More explicitly, if $q=x+iy+zj+tk$, then
$$\begin{aligned} A=\left( \begin{array}{ccc} x^2+y^2-z^2-t^2 &{} 2(yz-xt) &{} 2(xz+yt) \\ 2(xt+yz) &{} x^2-y^2+z^2-t^2 &{} 2(zt-xy) \\ 2(yt-xz) &{} 2(xy+zt) &{} x^2-y^2-z^2+t^2 \end{array} \right) . \end{aligned}$$
Note that we have identified $q\in {\mathbb {H}}$ and its equivalence class in ${\mathbb {S}}^3/\pm 1$ and that $\Phi $ is well defined since only quadratic expressions are involved. The fact that this group isomorphism is an isometry follows from (Degond et al. 2018b, Proposition A.3) and (Degond et al. 2017, Lemma 4.2).
2.
The expression
$$\begin{aligned} J\cdot A = \frac{1}{2} {{\,\mathrm{Tr}\,}}(\Phi (q)^\mathrm{T} J) \end{aligned}$$
is a quadratic form for q. We take Q the matrix associated with this quadratic form. For $J=(J_{ij})_{i,j}$, using the explicit form of $A=\Phi (q)$ with $q=x+yi+zj+tk$ we obtain:
$$\begin{aligned} Q=\frac{1}{4}\left( \begin{array}{cccc} {J_{11}+J_{22}+J_{33}} &{} J_{32}-J_{23} &{} J_{13}-J_{31} &{} J_{21}-J_{12} \\ J_{32}-J_{23} &{} {J_{11}-J_{22}-J_{33}} &{} J_{12}+J_{21} &{} J_{13}+J_{31} \\ J_{13}-J_{31} &{} J_{12}+J_{21} &{} {-J_{11}+J_{22}-J_{33}} &{} J_{23}+J_{32} \\ J_{21}-J_{12} &{} J_{13}+J_{31} &{} J_{23}+J_{32} &{} {-J_{11}-J_{22}+J_{33}} \end{array} \right) . \end{aligned}$$
This is an isomorphism since $\dim {\mathscr {M}}_3({\mathbb {R}})=\dim {\mathscr {S}}_4^0({\mathbb {R}})$ and J can be obtained from Q similarly. Moreover, since the bilinear matrix associated with a quadratic form is uniquely defined, if $Q\in {\mathscr {S}}_4^0({\mathbb {R}})$ is such that $\frac{1}{2}J\cdot \Phi (q)=q\cdot Qq$ for all $q\in {\mathbb {S}}^3/\pm 1$, then $Q=\phi (J)$.
3.
To prove the third point, we note that a unit quaternion can be seen as a rotation in ${\mathbb {R}}^3$ in a more geometrical way (Degond et al. 2018b, Section 5.1): for $\theta \in [0,\pi ]$ and ${\mathbf {n}}=(n_1,n_2,n_3)^\mathrm{T}\in {\mathbb {S}}^2$, let us define the unit quaternion q by
$$\begin{aligned} q=\cos \frac{\theta }{2}+\sin \frac{\theta }{2}(n_1i+n_2j+n_3k), \end{aligned}$$
The unit quaternion q represents the rotation of angle $\theta \in [0,\pi ]$ and axis ${\mathbf {n}}\in {\mathbb {S}}^2$ in the sense that if $R(\theta ,{\mathbf {n}})\in SO_3({\mathbb {R}})$ denotes the matrix associated with the rotation of angle $\theta \in [0,\pi ]$ around the axis ${\mathbf {n}}\in {\mathbb {S}}^2$ then $\Phi (q)=R(\theta ,{\mathbf {n}})$. Note that q and $-q$ represent the same rotation so $\Phi (q)$ is well defined by identifying q with its equivalence class in ${\mathbb {S}}^3/\pm 1$.

In ${\mathbb {R}}^3$ the composition of two rotations of respective angles and axis $(\theta ,{\mathbf {n}})\in [0,\pi ]\times {\mathbb {S}}^2$ and $(\theta ',{\mathbf {n}}')\in [0,\pi ]\times {\mathbb {S}}^2$ is itself a rotation: we have $R(\theta ,{\mathbf {n}})R(\theta ',{\mathbf {n}}')=R({\hat{\theta }},\hat{{\mathbf {n}}})$ where the angle ${\hat{\theta }}\in [0,\pi ]$ is defined by
$$\begin{aligned} \cos \frac{{\hat{\theta }}}{2}=\cos \frac{\theta }{2}\cos \frac{\theta '}{2}-{\mathbf {n}}\cdot {\mathbf {n}}'\sin \frac{\theta }{2}\sin \frac{\theta '}{2}. \end{aligned}$$
Note that $\cos ({\hat{\theta }}/ {2})=q\cdot {\bar{q}}'$ where q and $q'$ are the associated unit quaternions seen as vectors of dimension 4. In particular the dot product of two rotations matrices is
$$\begin{aligned} R(\theta ,{\mathbf {n}})\cdot R(\theta ',{\mathbf {n}}')=\frac{1}{2}{{\,\mathrm{Tr}\,}}\Big (R(\theta ,{\mathbf {n}})R(\theta ',-{\mathbf {n}}')\Big )=\frac{1}{2}(2\cos {\tilde{\theta }}+1) \end{aligned}$$
where
$$\begin{aligned} \cos \frac{{\tilde{\theta }}}{2}=\cos \frac{\theta }{2}\cos \frac{\theta '}{2}+{\mathbf {n}}\cdot {\mathbf {n}}'\sin \frac{\theta }{2}\sin \frac{\theta '}{2}. \end{aligned}$$
Besides, for the quaternions q and $q'$, respectively, associated with the rotations $R(\theta ,{\mathbf {n}})$ and $R(\theta ',{\mathbf {n}}')$, we have:
$$\begin{aligned} q'\cdot Qq'=(q\cdot q')^2-\frac{1}{4}=\cos ^2\frac{{\tilde{\theta }}}{2}-\frac{1}{4}=\frac{1}{4}(2\cos {\tilde{\theta }}+1), \end{aligned}$$
where Q is the normalised uniaxial Q-tensor:
$$\begin{aligned} Q=q\otimes q-\frac{1}{4}I_4. \end{aligned}$$
Finally, $\frac{1}{2}R(\theta ,{\mathbf {n}})\cdot R(\theta ',{\mathbf {n}}')=q'\cdot Qq'$ and we obtain thanks to the previous point:
$$\begin{aligned} \phi \Big (R(\theta ,{\mathbf {n}})\Big )=Q, \end{aligned}$$
that is to say: if $J\in SO_3({\mathbb {R}})$, then $\phi (J)$ is a normalised uniaxial Q-tensor.
4.
If $D={{\,\mathrm{diag}\,}}(d_1,d_2,d_3)$, then using the explicit form of $\phi $ given in the second point:
$$\begin{aligned} \phi (D)=\frac{1}{4}\left( \begin{array}{cccc} d_1+d_2+d_3 &{} &{} &{} \\ &{} d_1-d_2-d_3 &{} &{} \\ &{} &{} -d_1+d_2-d_3 &{} \\ &{} &{} &{} -d_1-d_2+d_3 \end{array}\right) , \end{aligned}$$
and if $Q={{\,\mathrm{diag}\,}}(s_1,s_2,s_3,s_4)$ with $s_1+s_2+s_3+s_4=0$, then
$$\begin{aligned} \phi ^{-1}(Q)=2\left( \begin{array}{ccc} {s_1+s_2} &{} &{} \\ &{} {s_1+s_3} &{} \\ &{} &{} {s_1+s_4} \end{array} \right) . \end{aligned}$$

$\square $

More About $SO_3\pmb {{\mathbb {(R)}}}$ and $SO_n\pmb {{\mathbb {(R)}}}$

Lemma A.1

For all $n\ge 3$:

$$\begin{aligned} {{\,\mathrm{Span}\,}}(SO_n({\mathbb {R}}))={\mathscr {M}}_n({\mathbb {R}}). \end{aligned}$$

Proof

First we prove that the diagonal matrices form a subset of ${{\,\mathrm{Span}\,}}(SO_n({\mathbb {R}}))$: it is enough to show that

$$\begin{aligned} D{:}=\left( \begin{array}{cccc}1 &{} &{} &{} \\ &{} 0 &{} &{} \\ &{} &{} \ddots &{} \\ &{} &{} &{} 0 \end{array}\right) \in {{\,\mathrm{Span}\,}}(SO_n({\mathbb {R}})) \end{aligned}$$

(the other diagonal matrices with only one nonzero coefficient can be obtained in a similar way). When n is odd:

$$\begin{aligned} D=I_n + \left( \begin{array}{cccc}1 &{} &{} &{} \\ &{} -1 &{} &{} \\ &{} &{} \ddots &{} \\ &{} &{} &{} -1 \end{array}\right) \end{aligned}$$

and both matrices in the sum are in $SO_n({\mathbb {R}})$. When $n\ge 4$ is even,

$$\begin{aligned} \left( \begin{array}{cc}0_4 &{} \\ &{} I_{n-4} \end{array}\right) =\frac{1}{2} I_n +\frac{1}{2}\left( \begin{array}{cc}-I_4 &{} \\ &{} I_{n-4} \end{array}\right) \in {{\,\mathrm{Span}\,}}(SO_n({\mathbb {R}})), \end{aligned}$$

thus:

$$\begin{aligned} \left( \begin{array}{ccccc}1 &{} &{} &{} &{} \\ &{} -1 &{} &{} &{} \\ &{} &{} -1 &{} &{} \\ &{} &{} &{} 1 &{} \\ &{} &{} &{} &{} 0_{n-4} \end{array}\right) =\left( \begin{array}{ccccc}1 &{} &{} &{} &{} \\ &{} -1 &{} &{} &{} \\ &{} &{} -1 &{} &{} \\ &{} &{} &{} 1 &{} \\ &{} &{} &{} &{} I_{n-4} \end{array}\right) -\left( \begin{array}{cc}0_4 &{} \\ &{} I_{n-4} \end{array}\right) \in {{\,\mathrm{Span}\,}}(SO_n({\mathbb {R}})), \end{aligned}$$

and similarly:

$$\begin{aligned} 4D= & {} \left( \begin{array}{ccccc}1 &{} &{} &{} &{} \\ &{} 1 &{} &{} &{} \\ &{} &{} 1 &{} &{} \\ &{} &{} &{} 1 &{} \\ &{} &{} &{} &{} 0_{n-4} \end{array}\right) +\left( \begin{array}{ccccc}1 &{} &{} &{} &{} \\ &{} -1 &{} &{} &{} \\ &{} &{} -1 &{} &{} \\ &{} &{} &{} 1 &{} \\ &{} &{} &{} &{} 0_{n-4} \end{array}\right) +\left( \begin{array}{ccccc}1 &{} &{} &{} &{} \\ &{} 1 &{} &{} &{} \\ &{} &{} -1 &{} &{} \\ &{} &{} &{} -1 &{} \\ &{} &{} &{} &{} 0_{n-4} \end{array}\right) \\&+\left( \begin{array}{ccccc}1 &{} &{} &{} &{} \\ &{} -1 &{} &{} &{} \\ &{} &{} 1 &{} &{} \\ &{} &{} &{} -1 &{} \\ &{} &{} &{} &{} 0_{n-4} \end{array}\right) \in {{\,\mathrm{Span}\,}}(SO_n({\mathbb {R}})). \end{aligned}$$

The SSVD (Definition 3.2) gives the result for any matrix. $\square $

Corollary B.1

For $n\ge 3$, a matrix that commutes with any matrix of $SO_n({\mathbb {R}})$ is of the form $\lambda I_n$.

We can now prove Lemma 3.3 and its generalisation Lemma 3.4.

Proof of Lemma 3.3

The linear map $\Phi : {\mathscr {M}}_n({\mathbb {R}})\rightarrow {\mathscr {M}}_n({\mathbb {R}})$ defined by

$$\begin{aligned} \Phi (J):=\int _{SO_n({\mathbb {R}})} (J\cdot A)A\,\mathrm{d}A, \end{aligned}$$

satisfies for all $P\in SO_n({\mathbb {R}})$,

$$\begin{aligned} \Phi (P)=P\Phi (I_n)=\Phi (I_n)P, \end{aligned}$$

which by Corollary B.1 means that

$$\begin{aligned} \Phi (I_n)=\lambda I_n. \end{aligned}$$

Therefore, $\Phi (P)=\lambda P$ for any $P\in SO_n({\mathbb {R}})$ and the same is true for any matrix $J\in {\mathscr {M}}_n({\mathbb {R}})$ by Lemma A.1. To compute $\lambda $, notice that for the matrix $e_i\otimes e_j$:

$$\begin{aligned} \frac{1}{2}\int _{SO_n({\mathbb {R}})} (e_i\cdot Ae_j)^2\,\mathrm{d}A=\lambda . \end{aligned}$$

Summing this equality for all i, j gives:

$$\begin{aligned} \frac{n}{2}=\lambda n^2, \end{aligned}$$

and $\lambda =1/2n$. $\square $

Proof of Lemma 3.4

Let us define the linear map $\psi : {\mathscr {M}}_n({\mathbb {R}})\longrightarrow {\mathscr {M}}_n({\mathbb {R}})$ by

$$\begin{aligned} \psi (J):=\int _{SO_n({\mathbb {R}})} (J\cdot A)A\,g(A)\,\mathrm{d}A. \end{aligned}$$

(47)

1.
We first note that $\psi $ is self-adjoint for the dot product $A\cdot B={{\,\mathrm{Tr}\,}}(A^\mathrm{T}B)$: for any $K\in {\mathscr {M}}_n(R)$,
$$\begin{aligned} \psi (J)\cdot K =\int _{SO_n({\mathbb {R}})} (J\cdot A)(K\cdot A)\,g(A)\,\mathrm{d}A=J\cdot \psi (K). \end{aligned}$$
2.
We prove that ${{\,\mathrm{Span}\,}}(I_n)$ is a stable subspace for $\psi $: for any $P\in SO_n({\mathbb {R}})$ we have:
$$\begin{aligned} P\psi (I_n) P^\mathrm{T}=\int _{SO_n({\mathbb {R}})} (I_n\cdot A) PAP^\mathrm{T}\,g(A)\,\mathrm{d}A=\psi (I_n), \end{aligned}$$
and we conclude with Corollary B.1 that $\psi (I_n)=\alpha I_n$ with:
$$\begin{aligned} \alpha =\frac{2}{n} I_n\cdot \psi (I_n)=\frac{1}{2n}\int _{SO_n({\mathbb {R}})} {{\,\mathrm{Tr}\,}}(A)^2 g(A)\,\mathrm{d}A. \end{aligned}$$
3.
Since $\psi $ is a self adjoint operator, the orthogonal subspace ${{\,\mathrm{Span}\,}}(I_n)^\perp $ is also a stable subspace. Moreover, using the change of variable $A\mapsto A^\mathrm{T}$, we see that $\psi (J^\mathrm{T})=\psi (J)^\mathrm{T}$ and we have the decomposition:
$$\begin{aligned} {{\,\mathrm{Span}\,}}(I_n)^\perp ={\mathscr {S}}_n^0({\mathbb {R}})\overset{\perp }{\oplus }{\mathscr {A}}_n({\mathbb {R}}), \end{aligned}$$
where ${\mathscr {S}}_n^0({\mathbb {R}})$ and ${\mathscr {A}}_n({\mathbb {R}})$ are, respectively, the subspace of trace free symmetric matrices and the subspace of skew-symmetric matrices. They are both stable subspaces.
4.
We prove now that $\psi : {\mathscr {S}}_n^0({\mathbb {R}})\rightarrow {\mathscr {S}}_n^0({\mathbb {R}})$ is a uniform scaling. By the spectral theorem, every matrix $J\in {\mathscr {S}}_n^0({\mathbb {R}})$ can be written
$$\begin{aligned} J=PDP^\mathrm{T}, \end{aligned}$$
where $P\in SO_n({\mathbb {R}})$ and D is diagonal. Since
$$\begin{aligned} \psi (PDP^\mathrm{T})=P\psi (D)P^\mathrm{T}, \end{aligned}$$
it is enough to prove that there exists $\lambda \in {\mathbb {R}}$ such that for all diagonal matrices D, $\psi (D)=\lambda D$. For $D={{\,\mathrm{diag}\,}}(d_1,\ldots ,d_n)$, we have:
$$\begin{aligned} \psi (D)=\frac{1}{2}\sum _{k=1}^{n} d_k \int _{SO_{n}({\mathbb {R}})} a_{kk} A\,g(A)\,\mathrm{d}A. \end{aligned}$$
Now, if $i\ne j$, let $D^{ik}\in SO_n({\mathbb {R}})$ be the diagonal matrix such that all the coefficients are equal to 1 except $D^{ik}_{ii}$ and $D^{ik}_{kk}$ which are equal to $-1$. Then, the change of variable $A\mapsto D^{ik}A(D^{ik})^\mathrm{T}$ gives
$$\begin{aligned} \int _{SO_{n}({\mathbb {R}})} a_{kk} a_{ij}\,g(A)\,\mathrm{d}A = -\int _{SO_{n}({\mathbb {R}})} a_{kk}\,a_{ij}\,g(A)\,\mathrm{d}A =0, \end{aligned}$$
which proves that $\psi (D)$ is diagonal and the ith coefficient of $\psi (D)$ is:
$$\begin{aligned} \psi (D)_{ii}=\frac{1}{2}\sum _{k=1}^{n} d_k \int _{SO_{n}({\mathbb {R}})} a_{kk}\,a_{ii}\,g(A)\,\mathrm{d}A. \end{aligned}$$
Using the fact that ${{\,\mathrm{Tr}\,}}(D)=0$ and that all the $\int _{SO_n({\mathbb {R}})} a_{ii}\,a_{kk}\,g(A)\,\mathrm{d}A$ are equal for $i\ne k$ (by using conjugation by the matrices $P^{ij}$, $i,j\ne k$, see Definition 3.1), we obtain:
$$\begin{aligned} \psi (D)_{ii}=\frac{1}{2}d_i \int _{SO_{n}({\mathbb {R}})} (a_{11}^2-a_{11}\,a_{22}) \,g(A)\,\mathrm{d}A, \end{aligned}$$
and we conclude that for all $J\in {\mathscr {S}}_n^0({\mathbb {R}})$:
$$\begin{aligned} \psi (J)=\lambda J, \end{aligned}$$
with
$$\begin{aligned} \lambda =\frac{1}{2}\int _{SO_{n}({\mathbb {R}})} (a_{11}^2-a_{11}\,a_{22}) \,g(A)\,\mathrm{d}A=\frac{1}{4}\int _{SO_n({\mathbb {R}})} (a_{11}-a_{22})^2\,g(A)\,\mathrm{d}A. \end{aligned}$$
5.
We prove similarly that $\psi : {\mathscr {A}}_n({\mathbb {R}})\rightarrow {\mathscr {A}}_n({\mathbb {R}})$ is a uniform scaling. Every $J\in {\mathscr {A}}_n({\mathbb {R}})$ can be written
$$\begin{aligned} J=P CP^\mathrm{T}, \end{aligned}$$
where
$$\begin{aligned} C=\left( \begin{array}{ccccccc} C_1 &{} &{} &{} &{} &{} &{} \\ &{} C_3 &{} &{} &{} &{} &{} \\ &{} &{} \ddots &{} &{} &{} &{} \\ &{} &{} &{} C_{2p-1} &{} &{} &{} \\ &{} &{} &{} &{} 0 &{} &{} \\ &{} &{} &{} &{} &{} \ddots &{} \\ &{} &{} &{} &{} &{} &{} 0 \end{array}\right) \end{aligned}$$
(48)
is a block diagonal matrix with blocks
$$\begin{aligned} C_i=\left( \begin{array}{cc} 0 &{} -c_i \\ c_i &{} 0 \end{array}\right) ,\,\,\,\,c_i\in {\mathbb {R}}^*,\,\,\,\,\,i\in \{1,3,5,\ldots ,2p-1\} \end{aligned}$$
so that,
$$\begin{aligned} \psi (J)=\psi (PCP^\mathrm{T})=P\psi (C)P^\mathrm{T}, \end{aligned}$$
and
$$\begin{aligned} \psi (C)=\frac{1}{2}\sum _{k=1}^{p} c_{2k-1} \int _{SO_n({\mathbb {R}})} (a_{2k,2k-1}-a_{2k-1,2k})\,A\,g(A)\,\mathrm{d}A. \end{aligned}$$
When $n\ge 3$, by using conjugation by the matrices $D^{2j-1,2j}$ for $j\in \{1,\ldots ,\left\lfloor n/2\right\rfloor \}$ (Definition 3.1) we can see that for each $k\in \{1,\ldots ,p\}$, the matrix
$$\begin{aligned} M_k:=\int _{SO_n({\mathbb {R}})} (a_{2k,2k-1}-a_{2k-1,2k})\,A\,g(A)\,\mathrm{d}A \end{aligned}$$
is of the form (48). Moreover, when $n\ge 5$, by using conjugation by matrices $D^{2j-1,2\ell -1}$ for $j\ne \ell $, $j,\ell \in \{1,\ldots ,\left\lfloor n/2\right\rfloor \}$ and $j,\ell \ne k$, we can see that all the diagonal blocks of $M_k$ are equal to zero except the one in position $2k-1$. When $n=3$ there is only one block in position 1 so the result holds but when $n=4$ such j and $\ell $ do not exist. In conclusion, when $n\ne 4$, $\psi (C)$ is of the form (48) and each diagonal block $C_{2k-1}'$ of $\psi (C)$ is written
$$\begin{aligned} C_{2k-1}'=\mu _{2k-1}C_{2k-1} \end{aligned}$$
with
$$\begin{aligned} \mu _{2k-1}:= & {} \int _{SO_n({\mathbb {R}})} (a_{2k,2k-1}-a_{2k-1,2k})a_{2k,2k-1}g(A)\,\mathrm{d}A\\ {}= & {} \int _{SO_n({\mathbb {R}})} (a_{21}-a_{12})a_{21}g(A)\,\mathrm{d}A, \end{aligned}$$
where this equality follows by using conjugation by “block-permutation matrices”:
$$\begin{aligned} Q^k:= \left( \begin{array}{ccccccccccc} I_2 &{} &{} &{} &{} &{} &{} &{} &{} &{} &{} \\ &{} \ddots &{} &{} &{} &{} &{} &{} &{} &{} &{} \\ &{} &{} I_2 &{} &{} &{} &{} &{} &{} &{} &{} \\ &{} &{} &{} 0_2 &{} -I_2 &{} &{} &{} &{} &{} &{} \\ &{} &{} &{} I_2 &{} 0_2 &{} &{} &{} &{} &{} &{} \\ &{} &{} &{} &{} &{} I_2 &{} &{} &{} &{} &{} \\ &{} &{} &{} &{} &{} &{} \ddots &{} &{} &{} &{} \\ &{} &{} &{} &{} &{} &{} &{} I_2 &{} &{} &{} \\ &{} &{} &{} &{} &{} &{} &{} &{} 1 &{} &{} \\ &{} &{} &{} &{} &{} &{} &{} &{} &{} \ddots &{} \\ &{} &{} &{} &{} &{} &{} &{} &{} &{} &{} 1 \end{array}\right) , \end{aligned}$$
where the first zero on the diagonal is in position $2k-1$. Therefore,
$$\begin{aligned} \psi (C)=\mu C, \end{aligned}$$
(49)
with
$$\begin{aligned} \mu =\frac{1}{2}\int _{SO_n({\mathbb {R}})}(a_{21}-a_{12})a_{21}\,g(A)\,\mathrm{d}A=\frac{1}{4}\int _{SO_n({\mathbb {R}})} (a_{21}-a_{12})^2\,g(A)\,\mathrm{d}A. \end{aligned}$$
(50)
6.
Finally, for $J\in {{\,\mathrm{Span}\,}}(I_n)^\perp $, writing
$$\begin{aligned} J=\frac{J+J^\mathrm{T}}{2}+\frac{J-J^\mathrm{T}}{2}, \end{aligned}$$
we have:
$$\begin{aligned} \psi (J)= \beta J+\gamma J^\mathrm{T}, \end{aligned}$$
with
$$\begin{aligned} \beta =\frac{1}{2}(\lambda +\mu )=\frac{1}{4}\int _{SO_n({\mathbb {R}})} \Big ((a_{11}^2-a_{11}\,a_{22})+(a_{21}-a_{12})a_{21}\Big )\,g(A)\,\mathrm{d}A, \end{aligned}$$
and
$$\begin{aligned} \gamma =\frac{1}{2}(\lambda -\mu )=\frac{1}{4}\int _{SO_n({\mathbb {R}})} \Big ((a_{11}^2-a_{11}\,a_{22})-(a_{21}-a_{12})a_{21}\Big )\,g(A)\,\mathrm{d}A. \end{aligned}$$
7.
In conclusion, writing the decomposition
$$\begin{aligned} J=\frac{1}{n}{{\,\mathrm{Tr}\,}}(J)I_n+K, \end{aligned}$$
where $K\in {{\,\mathrm{Span}\,}}(I_n)^\perp $, we obtain
$$\begin{aligned} \psi (J)=a{{\,\mathrm{Tr}\,}}(J) I_n+bJ+cJ^\mathrm{T}, \end{aligned}$$
with
$$\begin{aligned} a=\frac{\alpha -\beta -\gamma }{n}, \end{aligned}$$
and
$$\begin{aligned} b=\beta =\frac{1}{8}\int _{SO_n({\mathbb {R}})}\Big ((a_{11}-a_{22})^2+(a_{12}-a_{21})^2\Big )\,g(A)\,\mathrm{d}A, \end{aligned}$$
and
$$\begin{aligned} c=\gamma =\frac{1}{8}\int _{SO_n({\mathbb {R}})}\Big ((a_{11}-a_{22})^2-(a_{12}-a_{21})^2\Big )\,g(A)\,\mathrm{d}A. \end{aligned}$$
And there are of course many other ways to write the coefficients a, b and c.

$\square $

Remark B.1

In dimension 4, the result still holds for symmetric matrices. For general matrices, the result can be proved in particular cases, for instance when g is a function of the trace, by using an explicit parametrisation of $SO_4({\mathbb {R}})$ such as the 4-dimensional version of (9).

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Degond, P., Diez, A., Frouvelle, A. et al. Phase Transitions and Macroscopic Limits in a BGK Model of Body-Attitude Coordination. J Nonlinear Sci 30, 2671–2736 (2020). https://doi.org/10.1007/s00332-020-09632-x

Download citation

Received: 02 April 2019
Accepted: 04 May 2020
Published: 30 May 2020
Issue Date: December 2020
DOI: https://doi.org/10.1007/s00332-020-09632-x

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Phase Transitions and Macroscopic Limits in a BGK Model of Body-Attitude Coordination

Abstract

Similar content being viewed by others

Body-Attitude Alignment: First Order Phase Transition, Link with Rodlike Polymers Through Quaternions, and Stability

MaxEP and Stable Configurations in Fluid–Solid Interactions

A de Broglie–Bohm Model of Pure Shape Dynamics: N-Body system

1 Introduction

Theorem 1

Theorem 2

2 The BGK Equation and Other Related Models of Self-organisation

2.1 A Review of the Different IBM

Theorem 3

Theorem 4

2.2 A Review of the Different Kinetic Equations

3 Preliminaries: Structure and Calculus in \(SO_n({\mathbb {R}})\)

3.1 Structure and Haar Measure on \(SO_n({\mathbb {R}})\)

Lemma 3.1

Remark 3.1

Definition 3.1

Lemma 3.2

Proof

Lemma 3.3

Lemma 3.4

3.2 Volume Forms in \(SO_3\pmb {{\mathbb {(R)}}}\)

3.3 Singular Value Decomposition (SVD)

Proposition 3.1

Definition 3.2

Remark 3.2

4 Equilibria of the BGK Operator

4.1 Characterisation of the Equilibria and Compatibility Equations

Lemma 4.1

Remark 4.1

Proof

Remark 4.2

Theorem 5

Remark 4.3

Remark 4.4

Remark 4.5

Proposition 4.1

Proof

Corollary 4.1

Remark 4.6

Remark 4.7

Proposition 4.2

Proof of Theorem 5

Remark 4.8

4.2 Proof of Proposition 4.2

Theorem 6

Proposition 4.3

Proof of Proposition 4.2

4.3 Determination of the Equilibria for Each Density \(\rho \)

Proposition 4.4

Proof

Corollary 4.2

5 Convergence to Equilibria

Proposition 5.1

Theorem 7

Remark 5.1

Remark 5.2

5.1 A Gradient-Flow Structure in \(\pmb {{\mathbb {(R)}}}^3\)

5.1.1 Reduction to a Nonlinear ODE in \({\mathbb {R}}^3\) and Equilibria

Proposition 5.2

Proof

Proposition 5.3

Proof

Corollary 5.1

Proof

Proposition 5.4

Proof

Remark 5.3

Proposition 5.5

5.1.2 A Gradient-Flow Structure

Lemma 5.1

Proof

Remark 5.4

5.2 Stability of the Equilibria and Conclusion

Theorem 8

Lemma 5.2

Proof

Proof of Theorem 8