Abstract
Numerically solving the Boltzmann equation is computationally expensive in part due to the number of variables the distribution function depends upon. Another contributor to the complexity of the Boltzmann Equation is the quadratic collision operator describing changes in the distribution function due to colliding particle pairs. Solving it as efficiently as possible has been a topic of recent research, e.g. Cai and Torrilhon (Phys Fluids 31(12):126105, 2019. https://doi.org/10.1063/1.5127114), Wang and Cai (J Comput Phys 397:108815, 2019. https://doi.org/10.1016/j.jcp.2019.07.014), Cai et al. (Comput Fluids 200:104456, 2020. https://doi.org/10.1016/j.compfluid.2020.104456). In this paper we exploit results from representation theory to find a very efficient algorithm both in terms of memory and computational time for the evaluation of the quadratic collision operator. With this novel approach we are also able to provide a meaningful interpretation of its structure.
Similar content being viewed by others
1 Introduction
Solving the Boltzmann equation numerically is an area of ongoing research. There are already existing results based on Spectral-Fourier Approach by Pareschi and Russo [1] and Gamba and Tharkabhushanam [2], with recent work improving some of its shortcomings such as [3] on the preservation of conservations or [4] on the stability of such methods. Another ansatz is based on the spectral Hermite ansatz introduced by Grad (1949) [5] with early work [6] and more recent work such as [7, 8] on linearising the collision operator. Wang and Cai were recently [9] able to calculate the bilinear collision operator based on the spectral Hermite ansatz. However, their method is computationally expensive. In response, Cai et al. [10] developed and implemented a more efficient algorithm based on the spectral Burnett ansatz. This ansatz has also been looked into analytically [11] and numerically [12]. Inspired by Struchtrup’s (2005) tensorial approach to the collision operator in Equation 6.35 in [13], we chose a different approach for calculating the Spectral-Burnett approximation of the Boltzmann collision operator. The main difference lies in the exploitation of features of the basis set, which is adapted to the irreducible subspaces with respect to the orthogonal group of the polynomial space as it consists of real solid spherical harmonics multiplied with Laguerre polynomials. While we also implemented the mostly analytical calculation of collision coefficients for a variety of potentials, the focus of this paper is the algorithm and corresponding numerical code, which uses these coefficients to solve the space-homogeneous Boltzmann equation for any distribution. Utilizing the properties of the irreducible subspaces, compared to the previous works we are able to significantly reduce the memory and computation time required for calculating the numerical solution. We could achieve this by virtue of the uniqueness up to a constant of linear maps between the irreducible subspaces. Representation theory allowed us to isolate each bilinear map, the constant for each map that encodes particle collisions and identify all bilinear maps, that evaluate to zero due to the mathematical properties of the irreducible subspaces in question. Our method provides a deep understanding of the underlying structure of the bilinear collision operator, allows for a very flexible setup for numerical computations and therefore is ideal for a purposeful reduction of computational effort.
The structure of this paper is as follows: The next section provides an overview of the collision operator and the potentials used in this paper. Sections 3 and 4 describe the application of representation theory to the collision operator, while Sect. 5 explains the representation theoretically inspired algorithm and its implementation. In Sect. 6 we apply our algorithm and also compare the results with [9]. For an overview of the applied representation theory refer to the “Appendix”.
2 Boltzmann’s Collision Operator
To describe gas flows, the Boltzmann equation is known to be valid for a wide range of situations. In particular, it describes non-equilibrium gas flows accurately for Knudsen numbers in the transition regime between the range of Navier–Stokes–Fourier equations and the kinetic regime. The Boltzmann equation describes the evolution of the gas distribution function f, which in general depends on seven variables—time t, position space \(\textbf{x}\) and velocity space \(\textbf{c}\).
As done in previous papers we look at the space-homogenous Boltzmann equation to be able to focus solely on the collision operator:
where f gives the time-dependent velocity distribution of particles,
and \(\mathcal {S}(f,f)\) is the bilinear collision operator, see e.g. the textbooks [14, 15], modelling the binary collisions of the particles in the gas:
where \(\textbf{n}\) is the unit vector orthogonal to the collisional plane, the relative velocity \(\textbf{g}=\textbf{c}-\mathbf {c_1}\), the average velocity \(\textbf{h}=\frac{1}{2}(\textbf{c}+\mathbf {c_1})\), and the primed velocities \(\textbf{c}', \textbf{c}_1'\), \(\mathbf {g'}=\mathbf {c'}-\mathbf {c_1'}\) with \(||\mathbf {g'}||=||\textbf{g}||\), \(\mathbf {h'}=\textbf{h}\) are the velocities after the collision. Additionally, we assume the initial average velocity of the gas to be 0, and as this is one of the conserved moments it will always be 0. The solution of (1) converges to a Gaussian centered around the average velocity. Our assumption will allow us to exactly represent this solution with our numerical ansatz.
As discussed in several textbooks, [14, 15], they are given by
where \(\mathbf {g'}\) is the result of rotating \(\textbf{g}\) around \(\textbf{n}\) by the angle \(\chi \). According to Cercignani’s work [14], the kernel \(B(|\textbf{g}|,\chi )\) depends on the assumed potential between the colliding particles. Following [14, 15], different choices are possible and common models [7,8,9,10] include
-
The hard spheres model, i.e.
$$\begin{aligned} B(|\textbf{g}|,\chi )&= |\textbf{g}| \frac{d^2}{4}&\text {with }&d \text { a constant for the diameter of the colliding particles}. \end{aligned}$$(5) -
Its generalization given by the variational hard spheres (VHS) model,
$$\begin{aligned} B(|\textbf{g}|,\chi )&= |\textbf{g}|^{\frac{\eta -5}{\eta -1}} \frac{d^2}{4}. \end{aligned}$$(6) -
The inverse power law potentials (IPL), i.e. with \(\eta >3\) determining the power law exponent
$$\begin{aligned} B(|\textbf{g}|,\chi )&= |\textbf{g}|^{\frac{\eta -5}{\eta -1}} W_0 \frac{\,\text {d}W_0}{\,\text {d}\chi }{} & {} \text {with}&\chi&= \pi - 2 \int _0^{W_1} \left( 1-W^2 -\frac{2}{\eta -1}\left( \frac{W}{W_0}\right) ^{\eta -1}\right) ^{-\frac{1}{2}}\,\text {d}W \end{aligned}$$(7)$$\begin{aligned} \text {and } W_1&\text { the positive root of }{} & {} {} & {} 1-W_1^2 -\frac{2}{\eta -1}\left( \frac{W_1}{W_0}\right) ^{\eta -1}. \end{aligned}$$(8)We use the change of variables given in [7] to evaluate \(W_0(\chi )\).
We identify two types of assumed potentials corresponding to whether or not B depends on the angle \(\chi \). Additionally, \(\eta \) can be chosen from \((3,\infty )\). Note that the IPL potential models can be derived from assuming a specific potential between the colliding particles, whereas the VHS model only allows that for \(\eta =\infty \), in which case the two models correspond to each other. In this sense, the power potentials can be seen as more realistic models and the VHS models as a simplified version.
In this paper we will work with different examples of these models and compare them. In particular, we work with the classical \(\chi \)-dependent Maxwell Potential with \(\eta =5\), the \(\chi \)-independent and hence simplified Maxwell potential, as well as the hard power potential with \(\eta =10\). As we will explore in Sect. 4.2.2, these choices reflect different levels of computational difficulty both in a priori calculations as well as in the numerical evaluation.
3 Irreducible Burnett Ansatz
3.1 Traditional Setup
3.1.1 Expansion of the Distribution Function
To solve the space homogeneous Boltzmann equation numerically, we follow [16] to expand the distribution function with an orthonormal basis set \(\Upsilon _\alpha (\textbf{c})\),
where \(\alpha \) is typically a multi-index in three indices determining the basis polynomial \(\Psi _\alpha \), \( f^{(\text {eq})}(\textbf{c})\) is the global equilibrium or Maxwellian function and the scalar product is chosen for the weighted function space \(W_{\text {eq}}:L^2(\mathbb {R}^3,\mathbb {R}, \frac{1}{f^{(\text {eq})}} \,\text {d}{\textbf{c}})\), see e.g. [16]:
to allow us to identify the weights \(w_\alpha \) as moments of the distribution function. Here, we shift the velocity-coordinates such that the medium velocity is zero and rescale the variables so that the temperature is 1, giving us
Generalizing this approach, the distribution function \(f(t,\textbf{c})\) is approximated as a time-dependent polynomial in \(\textbf{c}\) of degree \(M\in \mathbb {N}\) or less, multiplied by the global Maxwellian,
implicitly choosing the approximation space for the distribution function \(f(t,\textbf{c})\in V_{\text {eq}}\) with
where \(\mathbb {R}[c_x,c_y,c_z]\) is the space of all real polynomials in three variables and \(N=\frac{1}{6}(M+1)(M+2)(M+3)\) the dimension of \(P^M\). We can equip the polynomial space with a scalar product corresponding to the setup (10):
3.1.2 Expansion of the Collision Operator
Inserting the expansion of the distribution function (9) into the space homogeneous Boltzmann equation (1) and using the linearity of the bilinear collision operator yields:
We now project this expression onto \(V_{\text {eq}}\) spanned by orthonormal basis functions \( \Upsilon _{\alpha }\). The left hand side leads to
while the right hand side evaluates to
Overall, we find that the rate of change of the distribution function’s weights depends on the collision vector \(s_{\alpha }\),
The collision vector in turn is determined by the bilinear operation (15) between the weights \(w_\alpha \) of the distribution function and a basis-dependent collision tensor
3.2 Structure of the Approximation Space
The remaining choice are the N basis polynomials \(\Psi _\alpha (\textbf{c})\), with which we span \(P^M\), which has great influence on the structure and sparsity of the collision tensor and hence on the required computational time and memory to obtain \(s_\alpha \).
For example Grad [5] or more recently Cai [9] choose Hermite polynomials in three coordinates, since these are known to be orthogonal with respect to the corresponding scalar product \(\langle \cdot ,\cdot \rangle _P\). However, structure and sparsity of the resulting collision tensor can still be improved.
We want to use a basis that exploits a key feature of the collision operator: It describes a physical process in the three dimensional velocity space, requiring it to be, like all physical processes, compatible with isometric transformations of the coordinate system, i.e. with \(\mathscr {O}_3\) the group of orthogonal matrices from \(\mathbb {R}^{3\times 3}\):
While this identity is often accepted, we included a proof in the “Appendix” for completeness, as this statement is central for this paper. This restriction allows us to employ results from the representation theory of \(\mathscr {O}_3\) and reduce the bilinear maps with which we express \(s_\alpha \) to only those bilinear maps between the \(\mathscr {O}_3\)-irreducible subspaces of \(P^M\).
3.2.1 Decomposition of \(P^M\) into Its Irreducible Subspaces
With the space of homogeneous and harmonic polynomials of degree \(n\in \{0,1,\cdots , M\}\) and dimension \(2n+1\)
we have the building blocks for the irreducible subspaces of \(P^M\) [17]. To make it clear when an index describes the degree of a harmonic polynomial, we call n the anisotropic index. We also define the one-dimensional space of homogeneous isotropic polynomials of second degree
and denote with \(c^2\) its element where \(\lambda =1\). Ignoring the chosen scalar product from Eq. (13) for now allows us to decompose \(P^M\) into a direct sum of homogeneous polynomials with respect to some corresponding scalar product,
We can check that this decomposition is complete by counting dimensions:
Figure 1 sorts these subspaces column-wise according to their homogeneous degree \(\text {deg}=2a+n\) and row-wise according to their anisotropic index n. Subspaces with same anisotropic index n are isomorphic to each other and the decomposition is not unique within each set of isomorphic subspaces. Subspaces with different anisotropic indices are non-isomorphic to each other and therefore necessarily pairwise orthogonal with respect to any \(\mathscr {O}_3\) invariant scalar product, see Remark 9.5 in this paper’s “Appendix”.
Now we come back to the chosen scalar product in (13). The isomorphic subspaces shown in Fig. 1 are not orthogonal with respect to \(\langle \cdot ,\cdot \rangle _P\). We therefore orthogonalize each set of isomorphic subspaces with a linear combination of suitable powers of \((c^2)\) determined by associated Laguerre polynomials:
where \({^{a}\mathscr {H}^{n}}\) are the orthogonalized irreducible subspaces. We call a the isotropic index and are left with the basis choice for \(\mathscr {H}^{n}\).
3.2.2 Basis Choice Adapted to the Irreducible Subspaces
Here we employ the real-valued solid spherical harmonics \(Y_n^l: \mathbb {R}^3 \rightarrow \mathbb {R}\) of n-th degree with the directional index \(l\in \{-n, -n+1, \cdots , n-1, n\}\), related to the real spherical harmonics \(\mathscr {Y}_n^l: S^2 \rightarrow \mathbb {R}\) by extension \(S^2 \rightarrow \mathbb {R}^3\) via
Note, that we can draw parallels to indices in quantum mechanics where the spherical harmonics are complex, and the principal, azimuthal and magnetic quantum numbers have similar mathematical roles to the isotropic index a, anisotropic index n and directional index l, respectively.
This leads to the following choice for \(\Psi _\alpha \):
with \((a,n,l)\in \mathbb {N}_0 \times \mathbb {N}_0 \times \{-n,\cdots ,n\}\) specifying the multi-index \(\alpha \), and the normalization coefficient given by
3.3 Discussion of the Basis Set and Irreducibility
Instead of immediately choosing a basis set, we first looked at the irreducible subspaces of \(P^M\) with respect to \(\mathscr {O}_3\) and collected them as spaces of homogeneous polynomials in Fig. 1. Upon finding that we have multiple isomorphic subspaces, all row-wise collected in Fig. 1, we linearly combined them via Laguerre polynomials to obtain \({^{a}\mathscr {H}^{n}}\), the orthogonalized irreducible subspaces. Then we chose basis polynomials for each \({^{a}\mathscr {H}^{n}}\), but again using the isomorphism between spaces within one row, it was enough to choose a set of basis polynomials for each lowest degree irreducible subspace per row, i.e. for the spaces \(\mathscr {H}^{n}\) of harmonic polynomials of degree n.
We have chosen real solid spherical harmonics, since they are easily accessible, orthogonal and adapted to the chain of subgroups \(\mathscr {O}_3 \ge \mathscr {O}_2 \ge \mathscr {O}_1\) with \(c_z\) the special symmetry direction, i.e. for an \(\mathscr {O}_2\)-invariant problem rotated so that \(c_z\) is the rotation axis for \(\mathscr {O}_2\), all coefficients with \(l\ne 0\) are zero.
The basis set containing all basis polynomials with \(2a +n \le M\) spans the full space \(P^M\). Note, that only basis polynomials with \(n=0\) are rotationally invariant. However, similar to the decomposition of a natural number into its prime factors, this basis is adapted to the decomposition of \(P^M\) into its smallest invariant subspaces. Here “smallest” is defined as an invariant subspace U, that cannot be further decomposed into invariant subspaces other than U itself and the zero space, similar to a prime number only being devisable by itself and 1. These subspaces are called irreducible. Figure 1 shows the decomposition of \(P^M\) into its homogeneous irreducible subspaces, while we use orthogonalized subspaces with respect to \(\langle \cdot ,\cdot \rangle _P\) given for a pair of chosen isotropic index a and anisotropic index n by
with a dimension of \(2n+1\).
Note that we have recovered the spectral Burnett ansatz with real spherical harmonics. First results with the spectral Burnett basis have been developed previously in [10,11,12, 18], but we are the first to exploit the underlying structure of the collision operator in our implementation. We also implement the nearly fully analytic calculation of the tensor coefficients and carry out numerical tests for large M.
4 Implications of Representation Theory
4.1 Decomposing the Collision Operator
Equation (15) is the orthogonal projection of \(\mathcal {S}(f,f)\) onto the approximation space \(V_{\text {eq}}\). Equivalently we can identify the bilinear map \(\mathcal {C}: P^{M} \times P^{M} \rightarrow P^{M}\) with coefficients given by the collision tensor in terms of the basis functions of \(P^{M}\), now defined as
i.e. with \(p,q \in P^{M}\) and their respective expansion coefficients \(w_{anl},u_{anl}\) we can express the map \(\mathcal {C}\) via
Not only is \(\mathcal {C}\) a bilinear map, it inherits the property of the collision operator \(\mathcal {S}\) that it is linear with respect to \(\mathscr {O}_3\), compare to Eq. (18):
Such a map is called natural. This restriction has been studied in representation theory and implies a decomposition of \(\mathcal {C}\) into a sum over isotropic and anisotropic indices
with \(S_{abc}^{n\hat{n}\tilde{n}}\in \mathbb {R}\) some constants that, given a set of maps \(\Phi _{abc}^{n\hat{n}\tilde{n}}\), distinguish between different maps \(\mathcal {C}\). Each \(\Phi _{abc}^{n\hat{n}\tilde{n}}: P^M \times P^M \rightarrow P^M\) is a non-zero natural bilinear map given by
with orthogonal projections \(\mathscr {P}_{(n,a)}: P^M \rightarrow {^{a}\mathscr {H}^{n}}\), isomorphisms \(\psi _{(n,a)}: {^{a}\mathscr {H}^{n}} \rightarrow \mathscr {H}^n\), embedding \(\mathscr {E}_{(n,a)}: {^{a}\mathscr {H}^{n}}\rightarrow P^M\), and the universal natural bilinear map
This map \(Q^{n\hat{n}\tilde{n}}\) is either zero or unique up to a constant, depending on the combination of anisotropic indices:
Note, that a decomposition of a (multi-) linear map between vector spaces into a sum of maps between subspaces as done in Eq. (29) by itself is generally possible. Equation (28) is crucial for the decomposition of \(\mathcal {C}\) in (29), requiring \(\Phi _{abc}^{n\hat{n}\tilde{n}}\) and \(Q^{n\hat{n}\tilde{n}}\) to be natural as well, and ultimately restricting all possible bilinear and natural maps \(\mathscr {H}^{\hat{n}} \times \mathscr {H}^{\tilde{n}} \rightarrow \mathscr {H}^n\) to be the same up to a constant with the additional condition (32).
4.2 Decoding the Structure of the Collision Tensor
Overall, the previous result implies the decomposition of the collision tensor into two tensors:
mirroring the structure of irreducible subspaces of \(P^M\). \(Q_{l\hat{l}\tilde{l}}^{n \hat{n} \tilde{n}}\) and \(\mathcal {S}_{n{\hat{n}}\tilde{n}}^{abc}\) have two distinct roles for the overall collision tensor. The coupling tensor \(Q_{l\hat{l}\tilde{l}}^{n \hat{n} \tilde{n}}\) collects the coefficients of the map \(Q^{n\hat{n}\tilde{n}}\). It is independent of the potential and encodes the purely mathematical coupling of \(Y_{\hat{n}}^{\hat{l}}\) and \(Y_{\tilde{n}}^{\tilde{l}}\) to \(Y_n^l\). The impact tensor \(\mathcal {S}_{n{\hat{n}}\tilde{n}}^{abc}\) encodes the physics of the particle collisions.
4.2.1 The Coupling Tensor
The coupling tensor \(Q_{l\hat{l}\tilde{l}}^{n \hat{n} \tilde{n}}\) encodes the natural bilinear map, unique up to a constant factor, from two harmonic polynomial spaces of degrees \(\hat{n}\) and \(\tilde{n}\) to a third harmonic polynomial space of degree n, see Eq. (31). Crucially, two natural bilinear maps between different, but isomorphic irreducible subspaces are identical up to a constant factor and hence all can be expressed in terms of one \(Q^{n \hat{n} \tilde{n}}\). Referring to Fig. 1 and knowing that all subspaces of the same row are isomorphic to each other, we can also picture \(Q^{n \hat{n} \tilde{n}}\) as the bilinear map between three rows. This universality is the key in our algorithm, allowing us to achieve high efficiency both in terms of computational time and memory usage. We can calculate the values of the coupling tensor for example with
The coupling tensor is related to the Clebsch–Gordon-Coefficients [19]. While the Clebsch–Gordon-Coefficients are more common in quantum physics, where they are usually used for three harmonic spaces of complex polynomials, the coupling tensor is the equivalent for real harmonic polynomials. It shares the property that it is zero for most combinations of anisotropic and directional indices \((n,\hat{n},\tilde{n},l,\hat{l},\tilde{l})\), but does not have as many restrictions as its complex counterpart. In particular, the conditions in Eq. (32) hold for \(Q_{l\hat{l}\tilde{l}}^{n \hat{n}\tilde{n}}\), and if they are met, then \(Q_{000}^{n \hat{n}\tilde{n}}\) is unequal to zero. Keeping in mind the uniqueness up to a constant of the coupling tensor for each triple of anisotropic indices \((\hat{n},\tilde{n},n)\), we choose one instance of \(Q^{n \hat{n} \tilde{n}}\) for each triple \((n, \hat{n}, \tilde{n})\), calculate the values of the coupling tensor once and hard-code them in the implementation as function
.
4.2.2 The Impact Tensor
In contrast to \(Q_{l\hat{l}\tilde{l}}^{n \hat{n} \tilde{n}}\), the impact tensor \(\mathcal {S}_{n{\hat{n}}\tilde{n}}^{abc}\) contains all information about the physics of the collision. While from the mathematical point of view the constant factor distinguishing two maps \(Q^{n \hat{n} \tilde{n}}\) with same anisotropic indices \((n, \hat{n}, \tilde{n})\) may seem irrelevant, it is clear that in the physical application these factors are the main interest. Here the conservation rules come into play as zeroes for particular combinations of anisotropic and isotropic indices \((n,{\hat{n}},\tilde{n},a,b,c)\), but this is also where different potentials lead to different values in the coefficients of the impact tensor. Notice, that the impact tensor is independent of the directional indices \((l,\hat{l},\tilde{l})\), so with our chosen map \(Q^{n \hat{n} \tilde{n}}\) we can compute its coefficients with
for combinations of anisotropic indices \((n,{\hat{n}},\tilde{n})\) fulfilling the conditions in (32). Depending on the potential, we can solve the integration in (35) fully or almost fully analytically with a computer algebra system, see the git-repository [20] for further details.
Referring to Table 1, we can now see where each chosen potential falls on the spectrum of high and low computational difficulty of both a priori evaluation, i.e. the calculation of the impact tensor itself, and runtime evaluation of the collision vector.
Note, that there is some relation between these two types of difficulty. The a priori evaluation only determines a set of numbers that make up the impact tensor, whereas the runtime evaluation mostly depends on the number of non-zero impact tensor entries and not on the specific non-zero entry \(\mathcal {S}_{n{\hat{n}}\tilde{n}}^{abc}\). As already noted by Grad [5], for the potentials with \(\eta =5\) the additional condition
applies, which stems from the orthogonality of the basis functions and the independence of the kernel from \(\textbf{c}\) and \(\mathbf {c_1}\). This significantly reduces the amount of non-zero impact tensor coefficients, leading to comparatively low computational difficulty both a priori and during runtime.
The a priori computational difficulty of the impact tensor coefficients additionally is determined by the \(\chi \)-dependency of the kernel B. All \(\chi \)-independent models are of lower difficulty, with the simplified Maxwell model the lowest difficulty due to (36). On the other side of the spectrum lies the hard power potential with \(\eta =10\), since it is \(\chi \)-dependent and condition (36) does not apply, requiring the evaluation of the full set of \(\mathcal {S}_{n{\hat{n}}\tilde{n}}^{abc}\). This potential also has high runtime computational difficulty, since almost all impact tensor coefficients are non-zero.
4.3 Full Generalization of the Collision Operator
Notice, that the decomposition of the collision tensor into the impact and coupling tensor not only implies a particular implementation to calculate the collision vector, but it also gives us a unique understanding of the role of each coefficient. In particular, we can now generalize the collision operator to any symmetric, bilinear map between real polynomials in three variables with physical meaning, i.e. invariance under transformation of the coordinate system: Such a map would not necessarily follow the conservation laws but still follow the decomposition into coupling and impact tensor. To obtain an upper limit for the computational cost of calculating the collision vector, we can assign the impact tensor random numbers for all combinations of anisotropic and isotropic indices that fulfil Eq. (32). When testing the performance with this setup, we shall call such a potential general.
5 Implementation
The task is to implement the tensor-vector multiplication between the expansion coefficients of the collision tensor in the form of \({S}_{n{\hat{n}}\tilde{n}}^{abc} Q_{l\hat{l}\tilde{l}}^{n \hat{n} \tilde{n}}\) and the expansion coefficients \(w_{anl}\) of the distribution function. The result is saved as a vector with expansion coefficients \(s_{anl}\), so we want to find an efficient algorithm to compute the collision vector s, i.e.
5.1 Custom Implementation Adapted to the Given Tensor Structure
Looking at the calculation task in (37), we can separate it into operations that need to be carried out for all combinations of anisotropic and isotropic indices. Beginning by setting the collision vector s to zero, we can rewrite the task as
Note, how the impact tensor is now outside of the sum, only remaining as a constant factor.
5.1.1 Implementing the Coupling Tensor
In fact, we can now isolate the map between the irreducible subspaces of different anisotropic and isotropic indices in terms of an operation that only involves the coupling tensors and the directional indices:
Here the anisotropic indices are only relevant to indicate the degree of the harmonic subspaces and hence the length of the involved vectors. Note, that we changed the positions of the indices and font of the coefficients of the distribution function and collision vector to indicate that this is an operation on a small subsection of the overall vectors.
Apart from the constant factor determined by the impact tensor, this operation is independent of the isotropic indices (a, b, c), so we can give it as a general operation between three generic vectors \(\varsigma ^{n},u^{\hat{n}},v^{\tilde{n}}\) of lengths \(2n+1, 2\hat{n}+1,2\tilde{n}+1\), respectively:
We obtain Eq. (39) by replacing \(\beta \) with the corresponding value of \(\mathcal {S}_{n{\hat{n}}\tilde{n}}^{abc}\).
Since \(Q_{l\hat{l}\tilde{l}}^{n \hat{n} \tilde{n}}\) are constants, the above operation (40) can be hard-coded without loss of generality of the overall implementation. We do so by giving each triple of anisotropic indices \((n, \hat{n}, \tilde{n})\) a unique case numer q and saving the expansion coefficients of both the distribution function \(w_{anl}\) and the collision vector \(s_{anl}\) in a specific order, so that we seperate it into coherent \(2n+1\) long sections, or sub-vectors, \(\text {w}^{an}\). We also order the \(w_{anl}\) such that the directional index l starts at \(-n\) and ends at n, see Fig. 2. This reflects the splitting of the polynomial space into its irreducible subspaces: Each section \(\text {w}^{an}\) contains the coordinates of w in one particular irreducible subspace characterized by the anisotropic and isotropic indices a and n. The position of each section \(\text {w}^{an}\) is given by a corresponding pointer \(p_{an}\), which locates the first of the \(2n+1\) entries. The collision vector is ordered similarly.
Furthermore, we know that the following symmetry relations must hold:
Therefore, we used Mathematica to generate the C++ function
that implements Eq. (40) for all combinations of anisotropic indices \((n, \hat{n}, \tilde{n})\) with \(\hat{n} \le \tilde{n}\). Input for this method is \(\beta \) as double
, q as integer
and the three vectors \(\xi ^{n},\zeta ^{\hat{n}},\varsigma ^{\tilde{n}}\) as three double pointers
,
,
. The method switches between all relevant triplets of anisotropic indices \((n, \hat{n}, \tilde{n})\), where q indicates the triplet and is thus used in the case distinction. For each case C++-code corresponding to the operation in (40) was generated. Here the three pointers are used as starting positions for the three vectors.
Note, that this code does not check, whether the three vectors actually have been allocated, all input is assumed to be valid in order for the computation to be as time-efficient as possible. Furthermore, it is always assumed that \(\hat{n}\le \tilde{n}\), the input vectors should be ordered accordingly (Fig. 3).
5.1.2 Implementing the Impact Tensor
In contrast to Q, the coefficients \(\mathcal {S}_{n{\hat{n}}\tilde{n}}^{abc}\) can vary depending on the potential. We therefore give the impact tensor in a text file as input for the code. Each line of this text file is an input list containing the necessary information for
, compare to Eq. (40) and see Fig. 4:
To calculate the collision vector, i.e. to compute (38), the C++-code iterates through each line of the impact tensor file and applies
with the case q, \(\beta =\mathcal {S}_{n{\hat{n}}\tilde{n}}^{abc}\) and the three vectors determined by \(p_{an}, p_{b\hat{n}}, p_{c\tilde{n}}\). Utilizing (41), we can shorten the input list by stating twice the value of \(\mathcal {S}_{n{\hat{n}}\tilde{n}}^{abc}\) when \((b,\hat{n}) \ne (c,\tilde{n} )\), leaving the line with \(\hat{n}<\tilde{n}\) and omitting the corresponding line of \(\mathcal {S}_{n\tilde{n}{\hat{n}}}^{acb}\).
5.1.3 Discussion of the Implementation
Effectively, we have now implemented the map \(\Phi _{abc}^{n\hat{n}\tilde{n}}\) from Eq. (30): Structuring the array of moments \(w_{anl}\) into subsections corresponding to the irreducible subspaces and using the positions given by the pointers \(p_{b\hat{n}}\) and \(p_{c\tilde{n}}\) as starting positions for the operation
corresponds to \(\phi _{(\hat{n},b)}\circ \mathscr {P}_{(\hat{n},b)}\) and \(\phi _{(\tilde{n},c)}\circ \mathscr {P}_{(\tilde{n},c)}\).
itself is the implementation of the map \( Q^{n\hat{n}\tilde{n}}\), only that it operates on the section of the vector \(s_{anl}\) indicated by the pointer \(p_{an}\), implementing \(\mathscr {E}_{(n,a)} \circ \phi ^{-1}_{(n,a)}\).
This setup is very flexible. It is easily adapted to different potentials, theories and initial conditions, i.e. the initial occupancy of \(w_{anl}\) of the distribution function. Omitting lines belonging to irrelevant combinations of \((a,b,c,n,{\hat{n}},\tilde{n})\) for the given setup, we can build calculation-specific input files for the collision operator. In Table 2 we have listed the required memory space for the impact tensor and the average calculation time of one time step for a selection of different potentials and theories. All computation is done sequentially on a CPU model Intel(R) Core(TM) i7-8650U. We note that the hard power potential with \(\eta =10\) is very close to the general potential in terms of computational requirements. In Table 3 we have listed the number of coupling coefficients for different M and the corresponding sizes of the compiled program. We note, that the memory requirement for the compiled program is much larger than that for the text file of the impact tensor.
5.2 Comparison to an Algorithm for General Tensor-Vector Multiplications and Current State of the Art
Given that the ultimate goal of this work is to produce an implementation of the evaluation of \(\mathcal {S}(f,f)\) within a numerical code, that is as fast as possible, the question is, whether the above described implementation is better than an algorithm developed specifically for the computation of tensor-vector multiplications. The efficient implementation of tensor operations is a field of on-going research in high performance computing and currently there are many different options available.
To check how the algorithm we developed compares to other algorithms, we chose the Tensor Algebra Compiler (TACO) [21] as a case study, as it provides code generation in C++ and seems to perform similarly well to other libraries. Here we found that TACO performed best when given the full collision tensor \(C_{\alpha \beta \gamma }\) with every triple of indices (a, n, l) mapped to one counter index. We found the computational times comparable but slightly slower than our implementation. We also compare memory requirements for the full tensor \(C_{\alpha \beta \gamma }\) with the irreducible Burnett approach and found that for a general potential, the full tensor \(C_{\alpha \beta \gamma }\) is much more demanding than what is needed for the impact tensor and
together. However, when measuring the memory requirements for the Maxwell potential, we initially found that the memory requirements for the full tensor \(C_{\alpha \beta \gamma }\) are less than the requirements for
. This is because saving
as compiled program is more memory intensive than a file with only numbers, and because we still compared to the general coupling tensor which has many more cases than actually necessary—for \(M=20\) out of the 1331 possible cases only 505 are actually required for the Maxwell potential, leading to a significantly smaller program of only 6.6MB. This reduction is due to the additional condition on the indices (36).
Assessing the performance of the irreducible Burnett algorithm would be incomplete without looking at the performance of the Burnett spectral method in [10] obtained from a single thread computation on a CPU model Intel Xeon E5-2697A V4. In Table 4 we can compare the three algorithms for the Maxwell potential. Additionally, for the hard Maxwell potential with \(\eta =10\) and \(M=M_0=20\), we find from Figure 15 in [10], that one evaluation of the collision operator takes about 90ms, whereas our algorithm takes about 50ms. Note however, that a different computational setup with different CPU models was used for each computation. We therefore conclude, that the computational time of both methods is similar for the IPL potential with \(\eta =10\), while the memory cost is greatly reduced with the irreducible Burnett method, which requires about 51MB, whereas the spectral Burnett method needs 784MB for just the sparse Maxwell potential. Table 4 also allows us to compare performance for the Maxwell potential of both methods and here we can see a significant difference both in computational time and memory usage.
Overall, we are satisfied to conclude that our implementation utilizes the underlying structure of the collision operator well enough to outperform the current state of the art.
5.3 Comparison to the Spectral Burnett Method in [10]
The spectral Burnett method used in the ansatz in [10] is based on complex solid harmonics. Here the presented algorithm exploits the relation for the directional indices—the collision tensor’s entries can only be non-zero if \(l=\hat{l}+\tilde{l}\) holds. By contrast, the irreducible Burnett ansatz uses real solid harmonics, for which this condition does not apply. Instead, we find a sparse coupling tensor with band structures, since the real solid harmonics are a simple linear combination of their complex counterparts. Crucially, the main difference between the two approaches lies in the decomposition of the collision tensor into the impact and coupling tensor. The algorithm of the irreducible Burnett ansatz exploits both this decomposition and the sparsity of the coupling tensor. It is possible, that the performance can still be improved by applying the decomposition to the basis with complex spherical harmonics.
5.4 Comparison to the Spectral Fourier Approach
The irreducible and the spectral Burnett ansatz share a similar approach to discretize the velocity space through moments. This method is based on picking a finite polynomial degree and using a finite-dimensional approximation space for the velocity-dependency of the distribution function. In contrast, the spectral Fourier approach discretizes the velocity space by choosing some finite 3-dimensional velocity domain and representing that with a grid of equally distant points. The collision operator is a weighted convolution. [2] and [10] compared its computational costs to the presented algorithm and found comparable performance. Fixing the velocity domain introduces problems in the form of aliasing effects, choice of cut off and conservation properties. On the other hand, moment methods have difficulty in approximating some distributions, an example for this will be demonstrated in Sect. 6.3.
Notably, the spectral Fourier approach is connected to the representation theory of abelian groups, such as \(\mathscr {O}_2\) or the group of translations. In fact, the functions \(e^{2\pi i \theta _k}\) used for the expansion each span a one-dimensional irreducible subspace [22].
5.5 Using the Irreducible Burnett Ansatz
As we will see in the numerical application, not all initial conditions are computable in the Burnett basis. However, moment equations allow a relatively low number of numerical degrees of freedom to describe physically relevant situations. Moments as variables encode physical quantities, usually those of low order are the most interesting, allowing us to interpret a subset of the numbers representing the distribution function in a meaningful way.
Moment equations come with a level of complexity, and the irreducible Burnett ansatz is no exception. Its implementation is challenging compared to other numerical methods for the Boltzmann equation such as direct simulation monte carlo methods [23] or the aforementioned spectral Fourier approaches. This is due to the low level C++ code of the coupling tensor, which needs to be generated using scripts or similar for a high polynomial degree M due to the large number of different cases. We generated the function
with Mathematica, and while the generating code can be checked, the overall code of
cannot be verified by a human, only tested for plausibility in a small number of cases. However, this paper must be seen in the context of moment approximations for the Boltzmann equation, see e.g. [13, 24, 25]. In this context, via one way or the other, the collision operator will always be implemented as some matrix–vector product in the linear case, or a tensor-vector-vector product in the bilinear case. While the implementation of such a (bi-) linear operation can be verified by a human, for large degree M the matrix or tensor entries themselves similarly cannot be checked due to their number.
Solving just the spatially homogeneous Boltzmann equation is incomplete, the overall goal is to provide a method, with which the collision operator can be evaluated efficiently. However, the application of the irreducible Burnett ansatz to the inhomogeneous Boltzmann equation is out of the scope of this paper and we refer to [26] for further details.
6 Numerical Application
The irreducible Burnett ansatz is the real pendant to the spectral Burnett ansatz, additionally mathematically decomposing the collision tensor. While the spectral Burnett ansatz in [10] uses complex solid harmonics, the distribution function is real for their method as well. Therefore we expect its properties to be inherited from the spectral Burnett ansatz, including the rate of convergence. All differences between results of both methods must stem from purely numerical effects resulting from computational accuracy. This section aims to ensure the correct implementation of the method. To do so, we will roughly estimate spectral convergence by comparing lower orders of approximation to the numerical solution with \(M=20\) for all cases and times. Additionally we compare to the analytical solution where possible.
Let f denote the exact distribution, \(\hat{f}\) the numerical solution with degree \(M=20\) and moments \(\hat{w}\), \(\tilde{f}\) any other numerical solution with smaller degree and moments \(\tilde{w}\), \(\Delta f:= \hat{f}-\tilde{f}\) and their difference. Using the \(L^2\)-norm implied by the scalar product on \(V_\text {eq}\) from Eq. (10),
\(\Vert {\Delta f} \Vert \) is simply given by the standard vector norm of the difference between their moments,
Unlike the exact solution f, which we can only access in the first test case, \(\Delta f\) is available for all times. It is also a good indicator for spectral convergence, as we find for the residual
Note that for \(t=0\), \(\hat{f}\) is the result of the orthogonal projection of f and therefore \(f-\hat{f}\) is in the orthogonal complement of \(V_{\text {eq}}\), implying that \(\Delta f \perp (f-\hat{f})\). Therefore, at \(t=0\)
holds, so when \(\Vert {\Delta f} \Vert \) decreases, so does \(\Vert {f-\tilde{f}} \Vert \). We will calculate \(\Vert {f-\hat{f}} \Vert \) at \(t=0\) to find the difference’s order of magnitude, but show plots of \(\Vert {\Delta f} \Vert \) for both \(t=0\) and \(t\ne 0\), as we are mostly interested in the evolution of the coefficients calculated by the irreducible Burnett ansatz.
We consider three initial distributions to demonstrate the range of numerical application of our method:
-
BKW-Solution, where the initial function can be visualized as a 3-dimensional hollow sphere.
-
Bi-Gaussian distribution, where the initial function consists of two overlapping Gaussians.
-
Discontinuous distribution, where the initial function consists of two different states that are seperated at a flat plane in velocity space and that haven’t interacted with each other yet.
We will look both at the effect of the choice of potential, as well as the difference between various numerical approximations, mainly given by the choice of different maximal polynomial degrees M. The equilibrium solution to the Boltzmann equation, the Gaussian distribution, lies in the chosen approximation space \(V_\text {eq}\) for any M. With the our basis the Gaussian distribution corresponds to the state \(w_{000}=1\) and all other moments equal to zero. Note, that all presented initial distributions evolve towards the Gaussian distribution with time for all presented potentials. Therefore, for large times we expect no difference between any two solutions, regardless of their initial distribution. This also implies that the residual between any two solutions decreases with increasing time. As a result, the focus should be on early time scales, because differences show up in early times and decrease with large time.
Additionally, conservation of mass, momentum and energy imply constraints on the first coefficients, making them irrelevant for further consideration:
6.1 BKW Solution
The BKW solution is one of the few analytical solutions to the homogeneous Boltzmann Equation with Maxwell-potential and has been originally developed by Krupp (1967); Bobylev et al. (1976) [27]. As done in [9], we use the dimensionless formulation from Ernst [27]:
where
The initial distribution \(f(0,\textbf{c})\) can be envisioned as a 3D hollow sphere. In particular, it is obvious, that this distribution is rotationally invariant, allowing us to plot it in cylindrical coordinates without loosing information. Since our basis is adapted to \(\mathscr {O}_3\), we can reduce the degrees of freedom to only those coefficients that correspond to rotationally invariant basis functions, i.e. only weights \(w_{anl}\) with \((a,n,l) = (a,0,0)\) can be non-zero (Fig. 5).
Choosing a maximal degree of \(M=20\), we only have 11 relevant coefficients:
Combined with the restraints from Eq. (42), this allows us to take a look at the remaining nine time-dependent moments, all of higher order with \(w_{200}\) the lowest one, to judge how well the analytical and numerical solutions match without loosing any information.
Spectral Convergence of the solution for the BKW distribution. The y-Axis shows \(\Vert {\Delta f} \Vert \), i.e. the \(L^2\)-Norm given by the weighted scalar product of the difference between the projection of the analytical solution onto the approximation space with \(M=20\) and the numerical solution with different maximal polynomial degrees M, the x-Axis shows this degree. The plots are shown for times between 0 (top graph) and 1 (bottom graph) in steps of 0.1 (in order from top to bottom) and the evolution is calculated with the Maxwell potential
Given that Cai et al. observed very good correspondence in [9, 10], and that the approximations differ in basis choice at most, we expect to obtain similar results. Looking at Fig. 6, we observe that this expectation is met. We can also see spectral convergence of our method in Fig. 7 by comparing results of lower polynomial degrees
to the projection of the analytical solution onto the approximation space with \(M=20\) for different times \(t\in [0, 1]\). We observe faster than exponential convergence, which is demonstrated by the downward sloping curves on the semi-log scale. However, there is no reason to expect that this behaviour translates from the very special BKW test case to the approximation of a general distribution.
Note, that for various M the difference between the evolution of the lowest order time-dependent moment \(w_{200}\), which all compared theories share, is in the order of \(10^{-14}\). This is due to the restriction in Eq. (36) imposed by the Maxwell potential, which for this particular case simplifies to
With \(a=2\), the only possible contributions can come from \((b,c)\in \left\{ (0,2),(1,1)\right\} \), which are already taken into account by the lowest order ansatz with \(M=4\). A corresponding argument applies to any higher moment—in this specific test case the increase of polynomial degree cannot have an influence on the lower moments, but it does add accuracy to the overall distribution, as shown in Fig. 7.
While the difference in basis choice and understanding of the underlying representation theory seems benign, in this particular case it means we have reduced the roughly 1700 perceived degrees of freedom to just 9. This also affects the number of relevant impact coefficients: For a rotationally invariant distribution and \(M=20\) we only need 34 impact coefficients \(\mathcal {S}_{n{\hat{n}}\tilde{n}}^{abc}\) and hence the evaluation time for one time step is only a few ms.
Figure 8 compares the evolution of the moments for different potentials. By omitting any line in the impact tensor file where \(b\ne 0\), we generated the linear collision operator. In Fig. 8a we can clearly see a difference between the linear Maxwell potential and the analytical solution, in particular in the higher moments. We also calculated the evolution of the coefficients for the simplified Maxwell model and found that it corresponds well to the analytical solution. Figure 8b compares the residual between the moments obtained from the numerical solution with \(M=20\) and the moments given by projecting the analytical solution onto the approximation space \(V_{\text {eq}}\) with \(M=20\) for the three different potentials. We can see that both the Maxwell and the simplified Maxwell potential produce very accurate moments with error in the order of \(10^{-13}\), while the linear potential differs significantly from the analytical solution. The plot also shows \(\Vert {f-\hat{f}} \Vert \), i.e. the difference between the analytical and numerical solution obtained with the Maxwell potential for the ansatz with \(M=20\) in the full function space equipped with the same scalar product as \(V_{\text {eq}}\). The residual starts at 8.9e\(-\)4 for \(t=0\) and we can see it steadily decreasing with time, as the solutions converge to the same Gaussian.
We conclude, that for rotationally invariant cases, the simplified Maxwell model produces similar results as the exact Maxwell model, and linearising the collision operator introduces a noticeable error. We also see even in this simple example the limit of spectral Burnett ansatz—moments can be accurately calculated, but the actual distribution function can only be approximated with some error depending on the degree M.
We also calculated the evolution of the coefficients for the simplified Maxwellian model and found that it corresponds to the analytical solution as well. The linearised simplified Maxwell model differs to the analytical solution similarly to the linearised Maxwell model. We conclude, that for rotationally invariant cases, the simplified Maxwell model is as good as the exact Maxwell model, and linearising the collision operator introduces a small but noticeable error (Fig. 9).
6.2 Bi-Gaussian Distribution
Again we take the initial distribution from [9]
Due to the distribution’s mirror symmetry w.r.t the x, y-plane, only even anisotropic indices n can result in non-zero projection coefficients in our basis. This is reflected in the projection of a monomial. Using the computer algebra system Mathematica, the projection of a monomial leads to a product from which it is easy to find a corresponding condition—any uneven degree leads to 0:
with some additional function \(K_1(a,b,c)\). We further see, that the distribution is \(\mathscr {O}_2\)-invariant for rotations around the z-axis, resulting in the additional constraint that unless the directional index \(l=0\), the coefficients are zero. Overall, with \(M=20\) we find 66 relevant coefficients from combinations of
We note, that with all the coefficients of uneven anisotropic indices being zero in the beginning and due to the conditions (32), we again may drop many lines from the impact tensor file, albeit not as many as in the BKW case. Figure 11 shows the residual between the ansatz with \(M=20\) and lower order approximations with the maximal polynomial degrees \(M\in \left\{ 10,12,14,16,18\right\} \).
Here we again see spectral convergence of the solution, indicating that this test case can also be well approximated by the spectral Burnett basis. Figure 10 indicates that the smallest and largest orders \(M=10\) and \(M=20\) both have similar looking distributions, with their largest difference located at the tips and overlap of the two Gaussians. We can further see from both Figs. 10 and 11 that the difference between approximations decreases with time as discussed in the beginning of this chapter. \(\Vert {f-\hat{f}} \Vert \) at \(t=0\) evaluates to 1.6e\(-\)3, the same order of magnitude as the first case. Overall, this indicates that the initial distribution is well approximated.
Spectral convergence of the solution for the bi-Gaussian distribution. The y-Axis shows \(\Vert {\Delta f} \Vert \), i.e. the \(L^2\)-Norm given by the weighted scalar product of the difference between the two solutions of different M and \(M=20\) as indicated by the x-Axis. The plots are shown for times between 0 (top graph) and 1 (bottom graph) in steps of 0.1 (in order from top to bottom) and the evolution is calculated with the Maxwell potential
Lastly, we compare the simplified and actual Maxwell potential in Fig. 12, where we look at the evolution of 19 out of the 66 coefficients as examples to demonstrate the range of possibilities. We note that for some moments the Maxwell and simplified Maxwell potential correspond to each other well, while for other moments those calculated with the Maxwell potential converge faster to 0. This trend is observed for most of the not displayed coefficients as well. However, we also note that some coefficients grow first, before converging to zero. Here we see, that the coefficients of the Maxwell potential remain closer to zero than those of the simplified Maxwell potential. Overall, we conclude that the simplified Maxwell potential can lead to a significantly different solution than the Maxwell potential in setups that are not rotationally invariant.
6.3 Discontinuous Distribution
Using the dimensionless formulation from [9] for this case we have
Projection of a monomial, evaluated by Mathematica, again leads to a product from which we can infer that certain index combinations lead to zero:
again with some additional function \(K_2(a,b,c)\) However, finding exclusion rules for the projection coefficients is less obvious here. They can only be non-zero if n is uneven or zero. We again find that the distribution function is \(\mathscr {O}_2\)-symmetric, requiring the projection coefficients to be zero unless \(l=0\). Note in contrast to the bi-Gaussian case, that even though at the beginning the relevant coefficients have an uneven or zero anisotropic index, two uneven anisotropic indices can couple to an even anisotropic index, see (32). So the total set of relevant coefficients \(w_{anl}\) for \(M=20\) is given by:
i.e. one coefficient per irreducible subspace, which makes 121 coefficients in total for \(M=20\). However, because all combinations of isotropic and anisotropic indices are possible, this case requires the full impact tensor and basis set, which has 1771 moments. Due to the \(\mathscr {O}_2\)-symmetry, most of them are zero and will always be zero for this test case. Looking at Fig. 13, we can clearly see that the discontinuity at \(t=0\) is not well approximated, even with a large degree of \(M=20\). This is reflected in \(\Vert {f-\hat{f}} \Vert \), which at \(t=0\) evaluates to 1.7e\(-\)1, a much larger value than in the other two cases.
Comparing the evolution of the distribution function in Fig. 14 for approximations of degree \(M=10\) and \(M=20\), we again find that the two approximations get closer to each other with time, as both converge to the same Gaussian.
Convergence of the solution for the discontinuous distribution. The y-Axis shows \(\Vert {\Delta f} \Vert \), i.e. the \(L^2\)-Norm given by the weighted scalar product of the difference between the two solutions of different \(M=20\) and M as indicated by the x-Axis. The plots are shown for times between 0 (top graph) and 1 (bottom graph) in steps of 0.1 (in order from top to bottom) and the evolution is calculated with the Maxwell potential
Discontinuous distribution: Evolution of the stress tensor entries \(\sigma _{11}\) and \(\sigma _{22}\) and the heat flux entry \(q_1\) for the hard IPL potential with \(\eta =10\). The results of the irreducible Burnett ansatz (irreducible, dots) are compared to those from [9] (reference, solid lines), with values kindly provided by the authors. Note, that we transformed our coordinates to match the orientation of the reference values for this plot
Figure 15 shows the residual between the ansatz with \(M=20\) and lower order approximations, each denoted by their maximal polynomial degree
Here we see a rather slow spectral convergence and that there is little gain of accuracy when increasing M from an uneven to an even degree, which is also why \(M=19\) was omitted. We propose that this is due to the initial condition, in which most of the coefficients with even \(2a+n\) are zero. Together with the badly approximated discontinuity shown in Fig. 13, this shows one of the limits of the approximation space. This is the first test case where all lines of the impact tensor file are contributing to the overall evolution of the distribution function and as with the previous example we can see clear differences in the evolution of coefficients between the simplified Maxwell and Maxwell potential in Fig. 16.
When looking at the heat flux and stress tensor, we find good correspondence to previously published results in [9] in Fig. 17, with an absolute difference in the order of \(10^{-6}\). Note, that the approximation space for this comparison is not the same. Referring to the method demonstrated in [9], our ansatz corresponds to \(M_0 = M = 20\), while the values provided by the authors have been calculated with \(M_0=15, M=60\). The good correspondence suggests that this difference has little influence on the moments of this low order and that the irreducible Burnett ansatz produces results very similar to the spectral Burnett ansatz. This should be the case, since the difference between the spectral and the irreducible Burnett ansatz lies solely in the exploitation of the collision tensor’s decomposition.
7 Summary
With the irreducible Burnett ansatz we were able to reduce the computational effort of calculating Boltzmann’s collision operator in the spectral Burnett ansatz both in computational time and memory by implementing an algorithm inspired by the representation theoretical decomposition of the collision operator. This decomposition relies on the fact that the used basis is adapted to the irreducible subspaces of the approximation space.
This decomposition not only allowed us to find an efficient algorithm, but also provides an understanding behind the structure of the collision tensor by separating factors that depend on the underlying physics from those that are of pure mathematical nature: The bilinear collision tensor can be expressed as product of the impact tensor with coefficients depending on the potential between the colliding particles and the coupling tensor with coefficients only depending on the basis choice of the irreducible subspaces.
With spherical harmonics as basis, further structure of the coupling tensor could be exploited by looking at \(\mathscr {O}_2\)-irreducible subspaces. For example, the coupling tensor could be split into the case where all anisotropic indices are zero and everything else. This is not necessarily generally applicable, but there may exist interesting use-cases.
Applying the method of this paper to collision operators of different kind, such as polyatomic gases [28] or the Landau operator [29], looks like a promising next step.
Data availibility
Enquiries about data availability should be directed to the authors.
References
Pareschi, L., Russo, G.: Numerical solution of the Boltzmann equation I: spectrally accurate approximation of the collision operator. SIAM J. Numer. Anal. 37(4), 1217–1245 (2000). https://doi.org/10.1137/S0036142998343300
Gamba, I.M., Tharkabhushanam, S.H.: Spectral-Lagrangian methods for collisional models of non-equilibrium statistical states. J. Comput. Phys. 228(6), 2012–2036 (2009)
Pareschi, L., Rey, T.: Moment preserving Fourier–Galerkin spectral methods and application to the Boltzmann equation. SIAM J. Numer. Anal. (2021). https://doi.org/10.48550/ARXIV.2105.13158
Hu, J., Qi, K., Yang, T.: A new stability and convergence proof of the Fourier–Galerkin spectral method for the spatially homogeneous Boltzmann equation. SIAM J. Numer. Anal. 59(2), 613–633 (2021). https://doi.org/10.1137/20M1351813
Grad, H.: On the kinetic theory of rarefied gases. Commun. Pure Appl. Math. 2(4), 331–407 (1949). https://doi.org/10.1002/cpa.3160020403
Sod, G.A.: Numerical solution of Boltzmann’s equation (1976). https://doi.org/10.2172/7184312
Cai, Z., Torrilhon, M.: Approximation of the linearized Boltzmann collision operator for hard-sphere and inverse-power-law models. J. Comput. Phys. 295, 617–643 (2015)
Cai, Z., Torrilhon, M.: Numerical simulation of microflows using moment methods with linearized collision operator. J. Sci. Comput. 74(1), 336–374 (2018)
Wang, Y., Cai, Z.: Approximation of the Boltzmann collision operator based on hermite spectral method. J. Comput. Phys. 397, 108815 (2019). https://doi.org/10.1016/j.jcp.2019.07.014
Cai, Z., Fan, Y., Wang, Y.: Burnett spectral method for the spatially homogeneous Boltzmann equation. Comput. Fluids 200, 104456 (2020). https://doi.org/10.1016/j.compfluid.2020.104456
Kumar, K.: Polynomial expansions in kinetic theory of gases. Ann. Phys. 37(1), 113–141 (1966). https://doi.org/10.1016/0003-4916(66)90280-6
Gamba, I.M., Rjasanow, S.: Galerkin–Petrov approach for the Boltzmann equation. J. Comput. Phys. 366, 341–365 (2018)
Struchtrup, H.: Grad’s moment method. In: Struchtrup, H. (ed.) Macroscopic Transport Equations for Rarefied Gas Flows: Approximation Methods in Kinetic Theory, pp. 87–107. Berlin, Springer (2005). https://doi.org/10.1007/3-540-32386-4_6
Cercignani, C.: The Boltzmann equation. In: Cercignani, C. (ed.) The Boltzmann Equation and Its Applications, pp. 40–103. New York, Springer (1988). https://doi.org/10.1007/978-1-4612-1039-9_2
Kremer, G.M.: Basic principles of the kinetic theory. In: Kremer, G.M. (ed.) An Introduction to the Boltzmann Equation and Transport Processes in Gases, pp. 1–35. Springer, Berlin (2010). https://doi.org/10.1007/978-3-642-11696-4_1
Cai, Z., Torrilhon, M.: On the Holway–Weiss debate: convergence of the Grad-moment-expansion in kinetic gas theory. Phys. Fluids 31(12), 126105 (2019). https://doi.org/10.1063/1.5127114
Applequist, J.: Traceless cartesian tensor forms for spherical harmonic functions: new theorems and applications to electrostatics of dielectric media. J. Phys. A Math. Gen. 22(20), 4303–4330 (1989). https://doi.org/10.1088/0305-4470/22/20/011
Burnett, D.: The distribution of molecular velocities and the mean motion in a nonuniform gas. Proc. Lond. Math. Soc. 2(1), 382–435 (1936). https://doi.org/10.1112/plms/s2-40.1.382
Alex, A., et al.: A numerical algorithm for the explicit calculation of SU(N) and SL(N, C) Clebsch–Gordan coefficients. J. Math. Phys. 52(2), 023507 (2011). https://doi.org/10.1063/1.3521562
Hanke, A., Torrilhon, M.: Representation theory based algorithm to compute Boltzmann’s bilinear collision operator in the irreducible spectral Burnett Ansatz efficiently. https://doi.org/10.5281/zenodo.5848971
Kjolstad, F., et al.: The Tensor Algebra Compiler. In: Proceeding of the ACM Programming Languages 1.OOPSLA (2017). https://doi.org/10.1145/3133901
Group representations. In: Lie Groups: An Approach through Invariants and Representations, pp. 195–239. Springer, New York, 2007. https://doi.org/10.1007/978-0-387-28929-8_8
Babovsky, H., Illner, R.: A convergence proof for Nanbu’s simulation method for the full Boltzmann equation. SIAM J. Numer. Anal. 26(1), 45–65 (1989). https://doi.org/10.1137/0726004
Müller, I., Ruggeri, T.: Extended thermodynamics of moments. In: Müller, I., Ruggeri, T. (eds.) Rational Extended Thermodynamics, pp. 197–220. Springer, New York (1998). https://doi.org/10.1007/978-1-4612-2210-1_9
Torrilhon, M., Sarna, N.: Hierarchical Boltzmann simulations and model error estimation. J. Comput. Phys. 342, 66–84 (2017). https://doi.org/10.1016/j.jcp.2017.04.041
Bünger, J., et al.: Structured derivation of moment equations and stable boundary conditions with an introduction to symmetric, trace-free tensors. Kinet. Relat. Models 16(3), 458–494 (2023). https://doi.org/10.3934/krm.2022035
Ernst, M.H.: Exact solutions of the nonlinear Boltzmann equation. J. Stat. Phys. 34(5), 1001–1017 (1984). https://doi.org/10.1007/BF01009454
Djordjic, V., Pavicolic, M., Torrilhon, M.: Consistent, explicit, and accessible Boltzmann collision operator for polyatomic gases. Phys. Rev. E 104, 025309 (2021). https://doi.org/10.1103/PhysRevE.104.025309
Pennie, C.A., Gamba, I.M.: Convergence and error estimates for the conservative spectral method for Fokker–Planck–Landau equations (2020). arXiv:2009.10352 [math.NA]
Hall, B.C.: Lie groups, lie algebras, and representations. In: Hall, B.C. (ed.) Quantum Theory for Mathematicians, pp. 333–366. Springer, New York (2013). https://doi.org/10.1007/978-1-4614-7116-5_16
Hall, B.: Matrix lie groups. In: Hall, B. (ed.) Lie Groups, Lie Algebras, and Representations: An Elementary Introduction, pp. 3–30. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-13467-3_1
Hall, B.: Lie algebras. In: Hall, B. (ed.) Lie Groups, Lie Algebras, and Representations: An Elementary Introduction, pp. 49–76. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-13467-3_3
Hall, B.C.: Basic representation theory. In: Hall, B.C. (ed.) Lie Groups, Lie Algebras, and Representations: An Elementary Introduction, pp. 77–107. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-13467-3_4
Weyl, H.: The Classical Groups: Their Invariants and Representations. Princeton University Press, Princeton (2016). https://doi.org/10.1515/9781400883905
Fulton, W., Harris, J.W.: Representation Theory: A First Course. Springer, New York (1991)
Brauer, R.: On algebras which are connected with the semisimple continuous groups. Ann. Math. 38, 857 (1937)
Fulton, W., Harris, J.: Representations of finite groups. In: Fulton, W., Harris, J. (eds.) Representation Theory: A First Course, pp. 3–11. New York, NY, Springer, New York (2004). https://doi.org/10.1007/978-1-4612-0979-9_1
Fulton, W., Harris, J.: Appendix B on multilinear algebra. In: Fulton, W., Harris, J. (eds.) Representation Theory: A First Course, pp. 471–477. Springer, New York (2004)
Hall, B.: Clebsch Gordan theory and the Wigner–Eckart theorem. In: Axler, S., Ribet, K. (eds.) Lie Groups, Lie Algebras, and Representations: An Elementary Introduction, pp. 425–434. Springer International Publishing, Cham (2015). https://doi.org/10.1007/978-3-319-13467-3
Weyl, H.: Chapter V. The orthogonal group. In: Weyl, H. (ed.) The Classical Groups: Their Invariants and Representations, pp. 137–164. Princeton University Press, Princeton (2016). https://doi.org/10.1515/9781400883905-007
Acknowledgements
This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation)-333849990/GRK2379 (IRTG Modern Inverse Problems). The authors have no relevant financial or non-financial interests to disclose. All authors contributed to the conception and design of the presented algorithm. Material preparation, data collection and analysis were mainly performed by Andrea Hanke. The first draft of the manuscript was written by Andrea Hanke and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript. Code and other data files concerning this paper are available in the repository [20].
Funding
Open Access funding enabled and organized by Projekt DEAL. The authors have not disclosed any funding.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have not disclosed any competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
1.1 Rotational Covariance of the Collision Operator
Repeating the quadratic collision operator S(f) given in Eq. (3)
we see it is a function in \(\textbf{c}\). Any orthogonal coordinate transformation \(r \in \mathscr {O}_3\) acts on any function \(T: \mathbb {R}^3 \rightarrow \mathbb {R}\) by
which applied to the collision operator is
Additionally, as a physically motivated map between function spaces, the collision operator is rotationally covariant, i.e. as stated in Eq. (18) it fulfils
This means that rotating or mirroring the input distribution function f will result in a correspondingly rotated/mirrored collision operator. To see that (49) is true, we will check that
holds. Since r is a \(3\times 3\) orthogonal matrix, \(r(\textbf{c}) = r \cdot \textbf{c}\). Noting that \(r^{-1} \left( f\right) (\textbf{c}) = f(r\cdot \textbf{c}))\), we find for the left hand side
omitting the dependency on time t for the remainder of this calculation. To obtain the right hand side, we just need to substitute \(\textbf{c}\) with \(r\cdot \textbf{c}\). We will make use of the variable change
which transforms the second pair of distributions in Eq. (51) on the left hand side. Comparing to the same pair of the right hand side we obtain:
Both sides are already very similar—the only difference is the tildation of \(\mathbf {c_1}\). For the first pair in Eq. (51) we look at the velocities \(r\cdot \textbf{c}'\) and \(r\cdot \mathbf {c_1}'\) individually. From (4) we obtain
and similarly for \(r\cdot {\mathbf {c_1}}\):
We further note, that \(\textbf{g}\) and \(\textbf{n}\) are affected by the variable change:
Note how the transformation r acts on the normal vector \(\textbf{n}\): The matrix R now rotates around the axis \(r(\textbf{n})\), instead of the axis \(\textbf{n}\), while the integration domain is transformed by \(r^{-1}\). Therefore, we can equivalently integrate over \(\textbf{n} \perp r\textbf{c} - {\tilde{\textbf{c}}}_{\textbf{1}}\) and substitute all \(\textbf{n}\) by \(r^{-1}\textbf{n}\). Now we are left with an expression for \(S(r^{-1}(f))\) that has \(r(\textbf{c})\) instead of \(\textbf{c}\), and \({\tilde{\textbf{c}}}_{\textbf{1}}\) instead of \(\mathbf {c_1}\) everywhere, which is equivalent to the right hand side of (50). This shows covariance of the collision operator.
Note the core reason for this: The collision operator is a concatenation of rotationally covariant operations such as taking the norm of a vector, or integrating over all directions.
1.2 Representation Theory: An Overview
Below we describe the core aspects of representation theory that we employed in this paper. This is a use-case restricted overview of the subject and we refer the reader to lecture books, e.g. [30,31,32,33,34,35], and to the original material such as [17, 36] for further reading and full proofs of the following statements.
Definition 9.1
(Representation (Definition 4.1 from [33])) Let G be a group and F a field. A representation of G over F consists of
-
a F-vector space V,
-
a map \(\cdot : G \times V \rightarrow V, (g,v) \mapsto g\cdot v\), called the operation of the group on the vector space, that satisfies the axioms in Table 5.
We can say V is a representation of G and G operates on V.
For our purposes, we are interested in representations of the orthogonal groups, mainly \(\mathscr {O}_3\) given by orthogonal matrices \(r\in \mathbb {R}^{3\times 3}\), i.e. \(G=\mathscr {O}_3\). The field F is \(\mathbb {R}\) and the vector spaces V we use are the tensor space of \(\mathbb {R}^3\) and the space of real polynomials in three arguments \(\mathbb {R}[x,y,z]\). For both the operation of \(\mathscr {O}_3\) is given in terms of the matrix vector-product \(r\cdot v\) with \(v, v_i \in \mathbb {R}^3\):
One can generalize this to the tensor product between two vector spaces V, U each a representation of \(\mathscr {O}_3\). Then \(V\otimes U\) is also a representation of \(\mathscr {O}_3\) with the action of \(r\in \mathscr {O}_3\) on \(v\otimes u\in V\otimes U\), \(v\in V, u\in U\), defined according to chapter 1.1 in [37] as
We are concerned with \(\mathbb {R}[x,y,z]\otimes \mathbb {R}[x,y,z]\).
Definition 9.2
(Homomorphism between two representations (Definition 4.3 from [33])) Let U, T be two representations of G and \(f: U\rightarrow V\) a homomorphism, i.e. a linear map. f is called FG-linear map, intertwining map or natural map, if the two axioms in Table 5 are satisfied. The space of all FG-linear maps is written as \({{\,\textrm{Hom}\,}}_{F G}(U,V)\). If the two vector spaces are identical, i.e. \(f:V\rightarrow V\), the FG-linear map is called endomorphism and the space of such maps is denoted with \({{\,\textrm{End}\,}}_{F G}(V)\).
If f is invertible, i.e. there exists a map \(f': V \rightarrow U\) with \(f\circ f' = f' \circ f = {{\,\textrm{id}\,}}\), then f is called an isomorphism of the representations. We call two representations isomorphic if such an f exists.
For two representations to be isomorphic, their vector spaces also need to be isomorphic. If the dimensions of two vector spaces are not equal, then they cannot be isomorphic representations.
Since the collision operator describes a physical process, one of its fundamental properties is, that it does not depend on the choice of coordinates. We also note, that it is a bilinear map between polynomial spaces, \(C: P^M \times P^M \rightarrow P^M\), which can be written as a linear map between the tensor product of two polynomial spaces to a polynomial space, \(P^M \otimes P^M \rightarrow P^M\). In mathematical terms, the collision operator must be a homomorphism from \({{\,\textrm{Hom}\,}}_{\mathbb {R}\mathscr {O}_3}(P^M\otimes P^M, P^M)\), i.e. a natural map. While this already restricts the degrees of freedom when compared to general linear map \(P^M \otimes P^M \rightarrow P^M\), we can go further by using the concept of irreducible subspaces.
Definition 9.3
(Invariant subspaces (Definition 4.2 from [33])) A subspace U of any vector space V is called invariant with respect to the operation of a group G, or G-invariant, iff for all \(g\in G\) and for all \(u\in U\), the operation of g on u still is in U, \(g\cdot u\in U\) or equivalently: \(\forall g\in G: g(U)=U\).
Definition 9.4
(Irreducible subspaces (Definition 4.2 from [33])) A subspace \(U\ne 0\) of any vector space V is called irreducible with respect to the operation of a group G, or G-irreducible, iff it is invariant and there is no invariant subspace \(W\subset U\) with the properties \(W\ne U\) and \(W\ne 0\). If a subspace U is invariant and \(\dim U=1\), then it is also irreducible, however an irreducible subspace can be of dimension larger than 1.
As mentioned in [17], the space of symmetric trace-free tensors of order n, \({{\,\textrm{STF}\,}}^n(\mathbb {R}^3)\), is \(\mathscr {O}_3\)-irreducible. With the isomorphism between \({{\,\textrm{STF}\,}}^n(\mathbb {R}^3)\) and \(\mathscr {H}^n\), which is also explored in [17], \(\mathscr {H}^n\) is also irreducible.
Remark 9.5
Two irreducible subspaces \(U_1, U_2\) that are not isomorphic to each other are orthogonal to each other with respect to any scalar product of V that is compatible with the operation of the group G, i.e. is itself a natural map:
We can project from the full vector space V onto an irreducible subspace \(U_1\). Due to the universal orthogonality, such a projection is always unique up to a constant (or zero) if the irreducible subspace \(U_1\) has a multiplicity of 1, i.e. there is no other irreducible subspace of the vector space isomorphic to \(U_1\).
This universal orthogonality between non-isomorphic irreducible subspaces is a consequence of one of the fundamental lemmas in representation theory:
Lemma 9.6
(Schur’s Lemma (Theorem 4.29 from [33])) Let \(U_1\) and \(U_2\) be two finite-dimensional, irreducible subspaces of a representation. A natural map \(\phi : U_1 \rightarrow U_2\) is only non-zero, iff \(U_1\) and \(U_2\) are isomorphic to each other. All non-zero natural maps \(U_1 \rightarrow U_2\) are the same up to a constant if the field is \(\mathbb {C}\).
Remark 9.7
For real representations, the space of natural maps \(U_1\rightarrow U_2\) between two isomorphic irreducible subspaces is not necessarily one-dimensional. We shall see with Brauer’s classification of natural maps that for our case (\(F=\mathbb {R}\), \(G=\mathscr {O}_3\), V the tensor space of \(\mathbb {R}^3\) or \(\mathbb {R}[x,y,z]\)), we only find one-dimensional spaces of such natural maps.
Remark 9.8
Given a homomorphism h in \({{\,\textrm{Hom}\,}}_{FG}(U,V)\) a list of all irreducible subspaces \(U_i\) of U and \(V_j\) of V and that for each pair of isomorphic subspaces \((U_i,V_j)\) the space of natural maps \(U_i \rightarrow V_j\) is one-dimensional, we can pick non-zero isomorphism \(\phi _{ij}: U_i \rightarrow V_j\), to express h as sum over all isomorphism \(\phi _{ij}\) multiplied by free constants, which are the only degrees of freedoms for h.
This follows immediately from Schur’s Lemma. Note, that the sum does not necessarily include all combinations of (i, j), but only those where the corresponding \(U_i\) and \(V_j\) are indeed isomorphic, since all other natural maps are zero. If there are multiple instances of the same irreducible subspace up to isomorphism within U (or V), we can make the splitting unique by choosing some orthogonal decomposition with respect to the chosen scalar product for U (or V).
Applying this idea to the setup in this paper, we first define the notation for the irreducible and orthogonal subspaces characterized with the isotropic and anisotropic indices:
Now we can decompose our bilinear collision operator as the sum over natural maps between irreducible subspaces,
and using the isomorphism between \(\mathscr {H}^n_a\) and \(\mathscr {H}^n\), we can restrict the different types of maps to
We now note that symmetric tensors are isomorphic to polynomials of the same degree and same underlying vectorspace, Chapter B.2 in [38], i.e. with \(\mathbb {R}^3\) we can map symmetric tensors of degree n from the space of symmetric tensors \({{\,\textrm{Sym}\,}}^n(\mathbb {R}^3)\) to homogeneous polynomials of degree n in \(\mathbb {R}[x,y,z]\). We identify the tensor space that is isomorphic to the space of harmonic polynomials \(\mathscr {H}^n\) of degree n as the space of symmetric trace-free tensors \({{\,\textrm{STF}\,}}^n\) of same degree, as is explored in [17]. Anything that we learn from the tensorial view,
we can apply to the polynomial view. We first note, that the tensor product of two representations is itself a representation. We also note, that the codomain of the natural map is an irreducible representation of \(\mathscr {O}_3\), so using Schur’s lemma, we know that the natural maps in (66) can only be non-zero if the codomain is isomorphic to one of the irreducible subspaces of the domain. Finding the irreducible subspaces of the tensor product of two representations is not trivial in general and this decomposition is called Clebsch-Gordan theory [39]. For the case of \(\mathscr {O}_3\) and symmetric trace-free tensors over \(\mathbb {R}\), one can find the decomposition of the maps in (66) relatively easily and we show the argument only for the sake of completeness. We start by considering the types of natural maps between tensors [36]:
-
Permuting tensor factors,
-
Taking the trace between two factors,
-
Taking the tensor product with the equivalent of \(\textbf{c}^2\), \(\omega :=\sum _{i} b_i \otimes b_i\) for any orthonormal basis set \(\{b_1, b_2, b_3\}\) of \(\mathbb {R}^3\), also known as the \(\delta _{ij}\)-tensor.
Any natural map between tensor spaces can be expressed as linear combination and composition of the above maps. They can be represented graphically with Brauer diagrams [36]. This gives us a search space for the isomorphism \({{\,\textrm{STF}\,}}^{\hat{n}} \otimes {{\,\textrm{STF}\,}}^{\tilde{n}} \rightarrow {{\,\textrm{STF}\,}}^n\). We first note, that the result must be a tracefree tensor, so we use the detracer \(\mathscr {D}\) from [17]. \(\mathscr {D}\) is an explicitly given projection from a symmetric tensor onto the symmetric tracefree tensor space of same order. As its input is a symmetric tensor, we need the symmetrizer (Part B.2 in [38])
which gives us the overall projector \(\mathscr {D}\circ q\), which is applied last. From Remark 9.5 and because the multiplicity of \({{\,\textrm{STF}\,}}^n\) in \({{\,\textrm{Sym}\,}}^n\) is one, we know that this map is unique up to a constant. What remains to be shown is for any natural linear map \(\phi \) that we apply before this projector such that the overall map \(\mathscr {D}\circ q \circ \phi \) is \({{\,\textrm{STF}\,}}^{\hat{n}} \otimes {{\,\textrm{STF}\,}}^{\tilde{n}} \rightarrow {{\,\textrm{STF}\,}}^n\), the overall maps at most differ by a constant factor and thereby fulfilling the promise in Remark 9.7. In other words: We want to show that
is one-dimensional or the space of maps to zero.
Next, we note that the overall tensorial degree of the domain is \(\hat{n}+\tilde{n}\), and we would like to show the restriction \(n\le \hat{n}+\tilde{n}\). To do so, we find all natural maps \({{\,\textrm{STF}\,}}^n \rightarrow {(\mathbb {R}^3)}^{\otimes (\hat{n}+\tilde{n})}\) and show that they must all be zero, leaving us with the conclusion that a representation isomorphic to \({{\,\textrm{STF}\,}}^n\) cannot be contained in \({(\mathbb {R}^3)}^{\otimes (\hat{n}+\tilde{n})}\). Any natural map \({{\,\textrm{STF}\,}}^n \rightarrow {(\mathbb {R}^3)}^{\otimes (\hat{n}+\tilde{n})}\) must reduce the tensorial degree n to \(\hat{n}+\tilde{n}\), leaving the trace as the only candidate, but since the tensors in \({{\,\textrm{STF}\,}}^n\) are trace-free by definition, the result is zero. First taking the tensor product, i.e. multiplying, with any number of \(\omega \) also does not yield a different result, since in the end more traces need to be taken than the number of \(\omega \) that have been multiplied, inadvertently leading to taking at least one trace of the tensor in \({{\,\textrm{STF}\,}}^n\). Therefore, for \(n> \hat{n}+\tilde{n}\) there does not exist a subspace in \({(\mathbb {R}^3)}^{\otimes (\hat{n}+\tilde{n})}\) isomorphic to \({{\,\textrm{STF}\,}}^n\), leading us to the desired restriction.
We further note, that the maps listed above all change the degree of tensor product by an even number, so \({\tilde{n}} + \hat{n} - n \in 2\mathbb {N}_0\). It is clear that the traces can only be taken between one factor of \({{\,\textrm{STF}\,}}^{\hat{n}}\) and one factor of \({{\,\textrm{STF}\,}}^{\tilde{n}}\) each, as opposed to a trace of two factors in \({{\,\textrm{STF}\,}}^{\hat{n}}\), which would map to zero by definition of \({{\,\textrm{STF}\,}}^{\hat{n}}\). This gives the third condition on the anisotropic indices from (32), \({\tilde{n}} + \hat{n} - n \le 2 \min ({\tilde{n}}, \hat{n})\). Finally, we note that the codomain is also tracefree, so if somewhere in the composition the tensor product with \(\omega \) is taken, an additional trace needs to be taken as well to achieve tracefreeness, otherwise the detracer would project onto zero due to the orthogonality between tracefree tensors and tensors with a trace (Part B, section 6 in [40]). Overall, the space of homomorphisms \({{\,\textrm{STF}\,}}^{\hat{n}} \otimes {{\,\textrm{STF}\,}}^{\tilde{n}} \rightarrow {{\,\textrm{STF}\,}}^n\) is one-dimensional, so each map is unique up to a constant.
Following the isomorphism back to the polynomial spaces, we can apply the results one-to-one, leading to the unique up to a constant natural map given by the coupling tensor. Looking at (64):
and picking one non-zero instance for each of the maps that satisfy (32):
we apply Remark 9.8 to write any natural map of the form (64) as
Noting that due to the isomorphism between the irreducible subspaces, all \(\tilde{Q}^{n\hat{n}\tilde{n}}_{abc}\) with same combination of anisotropic indices work the same, we arrive at the decomposition of the collision tensor from Eq. (33).
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hanke, A., Torrilhon, M. Representation Theory Based Algorithm to Compute Boltzmann’s Bilinear Collision Operator in the Irreducible Spectral Burnett Ansatz Efficiently. J Sci Comput 95, 78 (2023). https://doi.org/10.1007/s10915-023-02168-8
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10915-023-02168-8