1 Introduction

A central configuration, denoted as CC, is an initial configuration \((q_1, \ldots , q_n)\) in the Newtonian n-body problem, such that if the particles were all released with zero velocity, they would collapse toward the center of mass c at the same time. In the planar case, CCs are initial positions for periodic solutions which preserve the shape of the configuration. CCs also play an important role in the study of the topology of integral manifolds in the n-body problem (see Moeckel (2014) and the references given there).

1.1 State of the art

The investigation of central configurations for equal masses is a subcase of more general problem of central configurations with arbitrary positive masses. The general conjecture of finiteness of central configurations (relative equilibria) in the n-body problem is stated in Wintner (1941) and appears as the sixth problem of Smale’s eighteen problems for the twenty first century Smale (1998). We refer the reader to our paper Moczurad and Zgliczyński (2019) for the description of the state of the art of the planar problem.

For the spatial 5-body problem Moeckel in Moeckel (2001) established the generic finiteness of Dziobek’s CCs (CCs which are non-planar). A computer-assisted work by Hampton and Jensen (2011) strengthens this result by giving an explicit list of conditions for exceptional values of masses.

The spatial 5-body problem with equal masses was studied by Kotsireas and his coworkers (Kotsireas 2000; Faugére and Kotsireas 1999; Kotsireas and Lazard 2002 and references given there). They have shown that possible CCs with some reflectional symmetry can be divided into five classes. For four of them, they proved (computer assisted proof) the existence of a CC and stated a hypothesis that no CC exists in the remaining one. This hypothesis was later proven in Alvarez et al. (2008) through another computer assisted proof. In our paper, we confirm that there are exactly four non-equivalent non-planar CCs having a reflection symmetry for \(n=5\) bodies with equal masses; moreover, we prove that there are no non-symmetric non-planar CCs for \(n=5\).

Next relevant work on spatial CC for \(n=5\) is Lee and Santoprete (2009), where a complete classification of the isolated CCs of the 5-body problem was given. The approach has a numerical component, and hence, it cannot be claimed fully rigorous. (On the other hand, the existence of identified isolated CC has been proven using the Krawczyk’s operator, i.e., a tool from interval arithmetic we also use.) The proof also does not exclude the possibility that a higher dimensional set of solutions exists.

The above-mentioned works study the polynomial equations derived from the equations for CC using the (real or complex) algebraic geometry tools. In contrast, we take a different approach: We use standard interval arithmetic tools, and hence, in principle we can treat also other potentials which cannot be reduced to polynomial equations.

1.2 Main result

In the paper, we show that there exists only a finite number of different CCs (up to euclidean symmetries, scaling and permutations of bodies), for \(n=5,6\) in the spatial n-body Newtonian problem with equal masses. There are four non-planar CCs for \(n = 5\) and nine for \(n = 6\); they are listed in Sect. 6. Moreover, for each of these CCs, we give a list of all euclidean symmetries. In particular, we show that each of these central configurations has some reflectional symmetry.

Hence, any CC can be obtained from one of the above CCs by a suitable composition of translation, scaling, rotation, reflection, and permutation of bodies.

Theorem 1

In the spatial n-body Newtonian problem with equal masses, there exists exactly four for \(n=5\) and nine for \(n=6\) types of non-planar CCs. Each of them has a reflectional symmetry.

The method used in the present paper is a straightforward extension of our work Moczurad and Zgliczyński (2019) on the planar case to the spatial one. This is basically a brute force approach using standard interval arithmetic tools.

1.3 Structure of the paper

The paper is organized as follows. In Sects. 2 and 3, we recall notations, definitions and fundamental equations for central configurations. In Sect. 4 we explain the method of finding and recognizing symmetries of CCs—the idea is the same as in Moczurad and Zgliczyński (2019), however algebraic details are different. We present a subject more formally, since symmetries in 3D are more complicated than in the planar case. In Sect. 5, we give some details of the computer assisted proof focusing on the reduction of the configuration space to be searched. In Sect. 6, we give a full listing of central configurations for \(n=5\) and 6. In Sect. 7, we count the number of non-equivalent CCs for the case of masses close to the equal mass case, where, informally speaking, we call two CCs equivalent if they have the same geometrical shape. In Sect. 8, we report on our non-rigorous computations concerning the instability of all planar CCs in the equal mass case.

2 Equations for central configurations

This section is almost identical to some parts of Sect. 2 in Moczurad and Zgliczyński (2019). It is included here just to make the paper reasonably self-contained.

By |z| we denote the Euclidean norm of z, i.e., \(|z|=\left( \sum _{i=1}^d z_i^2\right) ^{1/2}\), where \(z\in {\mathbb {R}}^d\). By (x|y), we denote the standard scalar product, i.e., \((x|y)=\sum _{i=1}^d x_iy_i\), where \(x,y \in {\mathbb {R}}^d\). We often use \(z^2:=(z|z)\). Let \(q_i \in {\mathbb {R}}^d\), \(i=1,\dots ,n\) and \(d\geqslant 1\) (the physically interesting cases are \(d=1,2,3\)), where \(q_i\) is a position of ith body with mass \(m_i \in {\mathbb {R}}_+\). Let us set

$$\begin{aligned} M=\sum _{i=1}^n m_i. \end{aligned}$$
(1)

Central configurations are solutions of the following system of equations (see Moeckel (2014)):

$$\begin{aligned} \lambda (q_i-c) = \sum _{\begin{array}{c} j=1\\ j\ne i \end{array}}^n \frac{m_j}{r_{ij}^3}(q_i - q_j)=:\frac{1}{m_i}f_i(q_1,\dots ,q_n), \quad i=1,\dots ,n \end{aligned}$$
(2)

where \(\lambda \in {\mathbb {R}}\) is a constant, \(c=\left( \sum _{i=1}^n m_i q_i\right) /M\) is center of mass, \(r_{ij}=r_{ji}=|q_i - q_j|\) is the Euclidean distance between i-th and jth bodies and \((-f_i)\) is the force which acts on ith body resulting from the gravitational pull of other bodies. The system of Eq. (2) has the same symmetries as the n-body problem. It is invariant with respect to group of isometries of \({\mathbb {R}}^d\) and the scaling of variables.

The system (2) has dn equations and \(dn + 1\) unknowns: \(q_i \in {\mathbb {R}}^d\) for \(i = 1, \ldots , n\) and \(\lambda \in {\mathbb {R}}_+\). The system has a \({\mathcal {O}}(d)\) and scaling symmetry (with respect to \(q_i\)’s and \(m_i\)’s), where \({\mathcal {O}}(d)\) is an orthogonal group in dimension d. If we demand that \(c=0\) (which is obtained by a suitable translation) and \(\lambda =1\) (which can be obtained by scaling \(q_i\)’s or \(m_i\)’s), we obtain the equations (compare Moeckel (2014), Moeckel 2014; Albouy and Kaloshin 2012)

$$\begin{aligned} q_i= \sum _{j,j\ne i} \frac{m_j}{r_{ij}^3}(q_i - q_j)=:\frac{1}{m_i}f_i(q_1,\dots ,q_n), \quad i=1,\dots ,n. \end{aligned}$$
(3)

It is easy to see that if (3) is satisfied, then \(c=0\) and (2) also holds for \(\lambda =1\). A point \(q = (q_1,\dots ,q_n) \in \left( {\mathbb {R}}^d\right) ^n\) is called a configuration. If q satisfies (3), then it is called a normalized central configuration (abbreviated as CC). For the future use, we introduce the function \(F:\Pi _{i=1}^n{\mathbb {R}}^{d } \rightarrow \Pi _{i=1}^n{\mathbb {R}}^{d }\) given by

$$\begin{aligned} F_i(q_1,\dots ,q_n) = q_i - \sum _{j,j\ne i} \frac{m_j}{r_{ij}^3}(q_i - q_j), \quad i=1,\dots ,n. \end{aligned}$$
(4)

Then the system (3) becomes

$$\begin{aligned} F(q_1,\dots ,q_n)=0. \end{aligned}$$
(5)

It is well know that for any \((q_1,q_2,q_3,\dots ,q_n)\in ({\mathbb {R}}^d)^n\) holds

$$\begin{aligned} \sum _{i=1}^n f_i= & {} 0, \end{aligned}$$
(6)
$$\begin{aligned} \sum _{i=1}^n f_i \wedge q_i= & {} 0, \end{aligned}$$
(7)

where \(v \wedge w\) is the exterior product of vectors, the result being an element of exterior algebra. If \(d=2,3\) it can be interpreted as the vector product of v and w in dimension 3. The identities (6) and (7) are easy consequences of the third Newton’s law (the action equals reaction) and the requirement that the mutual forces between bodies are in direction of the other body.

Consider system (3). After multiplication of ith equation by \(m_i\) and addition of all equations using (6), we obtain (or rather recover) the center of mass equation

$$\begin{aligned} \left( \sum _{i=1}^n m_i\right) c=\sum _i m_i q_i = 0. \end{aligned}$$
(8)

We can take the equations for nth body and replace it with (8) to obtain an equivalent system.

$$\begin{aligned} q_i= & {} \sum _{j,j\ne i} \frac{m_j}{r_{ij}^3}(q_i - q_j)=:\frac{1}{m_i}f_i(q_1,\dots ,q_n), \quad i=1,\dots ,n-1, \end{aligned}$$
(9)
$$\begin{aligned} q_n= & {} -\frac{1}{m_n}\sum _{i=1}^{n-1} m_i q_i. \end{aligned}$$
(10)

In Sect. 3, we use (7) to define a reduced system of equations for CCs which does not have the degeneracies present in system (3).

3 The reduced system of equations for CC

The goal of this section is to derive a set of equations (the reduced system of equations), which gives all CCs, but the system will no longer have \({\mathcal {SO}}(3)\)-symmetry. This section is an extension to \(d=3\) of the results of Section 5 in Moczurad and Zgliczyński (2019), where the planar case \(d=2\) has been treated.

3.1 Non-degenerate solutions of full and reduced systems of equations

Following Moeckel (2014) we state the following definition.

Definition 1

We say that a normalized central configuration \(q=(q_1,\dots ,q_n)\) is non-degenerate if the rank of DF(q) is equal to \(dn-\dim {\mathcal {SO}}(d)\). Otherwise the configuration is called degenerate.

The idea of the above notion of degeneracy is to allow only for the degeneracy related to the rotational symmetry of the problem, because by setting \(\lambda =1\) in (2) and keeping the masses fixed we removed the scaling symmetry.

We write the system (910) obtained from (5) after removing the nth body using the center of mass equation (condition (8)) as

$$\begin{aligned} F_{\mathrm {red}}(q_1,\dots ,q_{n-1})=0, \end{aligned}$$
(11)

where \(F_{\mathrm {red}}: \Pi _{i=1}^{n-1}{\mathbb {R}}^{d} \rightarrow \Pi _{i=1}^{n-1}{\mathbb {R}}^{d} \). Then it is easy to see that \(q=(q_1,\dots ,q_{n-1},q_n)\) is a non-degenerate central configuration iff the rank of \(DF_{\mathrm {red}}(q_1,\dots ,q_{n-1})\) is \(d(n-1)-\dim {\mathcal {SO}}(d)\).

3.2 The reduced system \({{\mathcal {RS}}}\)

Let \(d=3\). The fact that the system of Eq. (3) is degenerate make this system not amenable for the use of standard interval arithmetic methods (see, for example, the Krawczyk operator) to rigorously count all possible solutions. We need to remove the \({\mathcal {SO}}(3)\)-symmetry and then hope that all solutions will be non-degenerate. In this section, we present such reduction.

Let us fix \(k_1,k_2 \in \{1,\dots ,n-1\}\), \(k_1 \ne k_2\) and consider the following set of equations

$$\begin{aligned} q_i= & {} \frac{1}{m_i}f_i(q_1,\dots ,q_n(q_1,\dots ,q_{n-1})), \quad i\in \{1,\dots ,n-1\}, i \ne k_1,k_2 \end{aligned}$$
(12)
$$\begin{aligned} x_{k_1}= & {} \frac{1}{m_{k_1}}f_{k_1,x}(q_1,\dots ,q_n(q_1,\dots ,q_{n-1})), \end{aligned}$$
(13)
$$\begin{aligned} x_{k_2}= & {} \frac{1}{m_{k_2}}f_{k_2,x}(q_1,\dots ,q_n(q_1,\dots ,q_{n-1})), \end{aligned}$$
(14)
$$\begin{aligned} y_{k_2}= & {} \frac{1}{m_{k_2}}f_{k_2,y}(q_1,\dots ,q_n(q_1,\dots ,q_{n-1})), \end{aligned}$$
(15)

where

$$\begin{aligned} q_n(q_1,\dots ,q_{n-1})= & {} -\frac{1}{m_n}\sum _{i=1}^{n-1} m_i q_i, \end{aligned}$$
(16)

where \(f_i = (f_{i,x}, f_{i, y},f_{i,z})\). In the sequel, we use the abbreviation \({{\mathcal {RS}}}\) to denote the reduced system (1215). Observe that \({{\mathcal {RS}}}\) coincides with the system (910) with the equations for \(y_{k_1},z_{k_1},z_{k_2}\) dropped.

Observe also that \({{\mathcal {RS}}}\) no longer has O(3) as a symmetry group. But still, it is symmetric with respect to the reflections against the coordinate planes.

The next theorem addresses the question: whether from \({{\mathcal {RS}}}\) we obtain the solution of (3)?

Theorem 2

Case 1 :

If \(q = (q_1,\dots ,q_n)\) is a solution of \({{\mathcal {RS}}}\) and the following conditions are satisfied

  1. (A1)

    \(x_{k_1} \ne x_n\),

  2. (A2)

    the vectors \((x_{k_1}-x_n,y_{k_1}-y_n)\) and \((x_{k_2}-x_n,y_{k_2}-y_n)\) are linearly independent, then it is a normalized central configuration, i.e., it satisfies (3).

Case 2 :

If q is a solution of \({{\mathcal {RS}}}\) such that \(z_i=0\) for \(i=1,\dots ,n\) and condition (A1) is satisfied, then q is a normalized central configuration, i.e., it satisfies (3).

Proof

First we show Case 1. For any configuration q,, we set

$$\begin{aligned} R_i(q_1,\dots ,q_n)=m_iq_i - f_i(q_1,\dots ,q_n), \quad i=1,\dots ,n \end{aligned}$$
(17)

and for any \((q_1,\dots ,q_{n-1}) \in ({\mathbb {R}}^3)^{n-1}\), we define

$$\begin{aligned} {\tilde{R}}_i(q_1,\dots ,q_{n-1})=R_i(q_1,\dots ,q_{n-1},q_n(q_1,\dots ,q_{n-1})), \quad i=1,\dots ,n. \end{aligned}$$
(18)

We use the notation \({\tilde{R}}_i=({\tilde{R}}_{i,x},{\tilde{R}}_{i,y},{\tilde{R}}_{i,z})\). Observe that for any \((q_1,\dots ,q_{n-1}) \in ({\mathbb {R}}^3)^{n-1}\) holds

$$\begin{aligned} {\tilde{R}}_n(q_1,\dots ,q_{n-1})=-\sum _{i=1}^{n-1}{\tilde{R}}_n(q_1,\dots ,q_{n-1}). \end{aligned}$$
(19)

Indeed, from (6) and (16)–(18), we have

$$\begin{aligned} {\tilde{R}}_n(q_1,\dots ,q_{n-1})= & {} R_n(q_1,\dots ,q_{n-1},q_n(q_1,\dots ,q_{n-1})) \\= & {} m_n q_n(q_1,\dots ,q_{n-1})-f_n(q_1,\dots ,q_{n-1},q_n(q_1,\dots ,q_{n-1})) \\= & {} -\sum _{i=1}^{n-1} m_i q_i + \sum _{i=1}^{n-1}f_i(q_1,\dots ,q_{n-1},q_n(q_1,\dots ,q_{n-1})) \\= & {} - \sum _{i=1}^{n-1}{\tilde{R}}_n(q_1,\dots ,q_{n-1}). \end{aligned}$$

Observe that from (7) it follows that for any configuration \((q_1,\dots ,q_n)\) holds

$$\begin{aligned} \sum _{i=1}^n q_i \wedge R_i(q_1,\dots ,q_n)=0. \end{aligned}$$
(20)

In particular, for \(q_n=q_n(q_1,\dots ,q_{n-1})\), we obtain from (19)

$$\begin{aligned} 0= & {} \sum _{i=1}^{n-1}q_i \wedge {\tilde{R}}_i \left( q_1,\dots ,q_{n-1}\right) + q_n(q_1,\dots ,q_{n-1}) \wedge {\tilde{R}}_n \left( q_1,\dots ,q_{n-1}\right) \nonumber \\= & {} \sum _{i=1}^{n-1}(q_i-q_n) \wedge {\tilde{R}}_i(q_1,\dots ,q_{n-1}). \end{aligned}$$
(21)

From now on, we assume that \(q=(q_1,\dots ,q_{n-1})\) is a solution of \({{\mathcal {RS}}}\) and \(q_n=q_n(q_1,\dots ,q_{n-1})\). Without any loss of the generality, we can assume that \(k_1=n-1\) and \(k_2=n-2\). We need to show that \({\tilde{R}}_{n-1,y}(q)={\tilde{R}}_{n-1,z}(q)={\tilde{R}}_{n-2,z}(q)=0\). From (21), we have

$$\begin{aligned} 0= & {} \sum _{i=1}^{n-1}(q_i-q_n) \wedge {\tilde{R}}_i(q)\\= & {} (q_{n-2}-q_n)\wedge {\tilde{R}}_{n-2}(q) + (q_{n-1}-q_n)\wedge {\tilde{R}}_{n-1}(q)\\= & {} (q_{n-2}-q_n)\wedge (0,0,{\tilde{R}}_{n-2,z}(q)) + (q_{n-1}-q_n)\wedge (0,{\tilde{R}}_{n-1,y}(q),{\tilde{R}}_{n-1,z}(q))\\= & {} \begin{bmatrix} (y_{n-2}-y_n){\tilde{R}}_{n-2,z}(q) + (y_{n-1}-y_n){\tilde{R}}_{n-1,z}(q) - (z_{n-1}-z_n){\tilde{R}}_{n-1,y}(q) \\ -(x_{n-2}-x_n){\tilde{R}}_{n-2,z}(q) - (x_{n-1}-x_n){\tilde{R}}_{n-1,z}(q) \\ (x_{n-1}-x_n){\tilde{R}}_{n-1,y}(q)\end{bmatrix} \\= & {} \begin{bmatrix} (y_{n-2}-y_n) &{} - (z_{n-1}-z_n) &{} (y_{n-1}-y_n) \\ -(x_{n-2}-x_n) &{} 0 &{} -(x_{n-1}-x_n) \\ 0 &{} (x_{n-1}-x_n) &{} 0 \end{bmatrix} \cdot \begin{bmatrix} {\tilde{R}}_{n-2,z}(q) \\ {\tilde{R}}_{n-1,y}(q) \\ {\tilde{R}}_{n-1,z}(q) \end{bmatrix}. \end{aligned}$$

This is a homogenous linear system. If the determinant of its matrix is nonzero, then it has only the zero solution. It is easy to see that this implied by the following two conditions

$$\begin{aligned} x_{n-1}-x_n \ne 0 \\ \det \left[ \begin{array}{cc} (y_{n-2}-y_n), &{} (y_{n-1}-y_n) \\ -(x_{n-2}-x_n), &{} -(x_{n-1}-x_n) \\ \end{array} \right] \ne 0 \end{aligned}$$

The second condition means that vectors \((x_{n-2}-x_n,y_{n-2}-y_n)\) and \((x_{n-1}-x_n,y_{n-1}-y_n)\) are linearly independent.

Now we treat Case 2, where \(z_i=0\), for \(i=1,\dots ,n\). Then we obtain \({\tilde{R}}_{n-1,z}(q)={\tilde{R}}_{n-2,z}(q)=0\), and we just need to show that \({\tilde{R}}_{n-1,y}(q)=0\). After substitution of this information in the above formulas, we obtain

$$\begin{aligned} 0= & {} \sum _{i=1}^{n-1}(q_i-q_n) \wedge {\tilde{R}}_i(q) = \begin{bmatrix} 0 \\ 0 \\ (x_{n-1}-x_n) {\tilde{R}}_{n-1,y}(q), \end{bmatrix} \end{aligned}$$

and our assertion follows immediately. \(\square \)

Observe that condition (A2) is never satisfied for collinear solutions and also might not be satisfied for some planar solutions containing three collinear bodies—such solutions exist for \(n=5\) and more, see (Moczurad and Zgliczyński 2019, Sec. A.2). This is why we included the second assertion in Theorem 2.

Other issue is how to know that a particular solution of the reduced system (1215) is contained in the plane \(\{z=0\}\), while our information about the solution is that it is a unique solution in some interval set (box), which is not contained in this plane. This issue is addressed below.

3.3 Collinearity and coplanarity tests

To define the reduced system we select \(k_1=n-1\) and \(k_2=1\), which implies that \(q_{n-1}=(x_{n-1},0,0)\) and \(q_1=(x_1,y_1,0)\). Let us denote by \(R_y(x,y,z)=(x,-y,z)\) and \(R_z(x,y,z)=(x,y,-z)\) the reflections with respect to OXZ and OXY planes, respectively. Let q be a configuration. For any map \(R:{\mathbb {R}}^d\rightarrow {\mathbb {R}}^d\), we set

$$\begin{aligned} R(q)=(Rq_1,\dots ,R q_n). \end{aligned}$$

The proposed tests are based on the effective procedure to check the local uniqueness for the reduced system which, in our case, is the application of the Krawczyk operator Krawczyk (1969) (see also (Moczurad and Zgliczyński 2019, Sec. 6.3)).

3.3.1 Coplanarity test

Observe that if q is a coplanar solution of the reduced system and \(q_1\), 0 and \(q_{n-1}\) are not collinear, then q must be contained in the plane \(\{z=0\}\).

Theorem 3

Assume that C is a box containing a unique solution q of \({{\mathcal {RS}}}\) and that the set \(C \cup R_z C\) contains a unique solution of that system. Then \(q=R_z q\), i.e., q is contained in the plane \(\{z=0\}\).

Proof

Observe that \({{\mathcal {RS}}}\) is symmetric with respect to \(R_z\), and hence if q is a unique solution of \({{\mathcal {RS}}}\) in C, then \(R_z q\) is a unique solution of \({{\mathcal {RS}}}\) in \(R_z C\). From the uniqueness in \(C \cup R_z C\), it follows that \(q=R_z q\). \(\square \)

3.3.2 Collinearity test

Observe that if q is a collinear solution of \({{\mathcal {RS}}}\) , then it is contained in the OX-axis. Indeed, \(q_{n-1}\) and \(c=0\) (the center of mass) belongs to the line containing CC.

Theorem 4

Assume that C is a box containing a unique solution of \({{\mathcal {RS}}}\) and that each of the sets \(C \cup R_y C\) and \(C \cup R_z C\) contains a unique solution of \({{\mathcal {RS}}}\) , then the unique solution in C is collinear.

Proof

From the coplanarity tests, it follows that \(y_i=z_i=0\), for all \(i=1,\dots ,n\). Hence the solution is contained in the OX-axis. \(\square \)

4 Symmetries

The goal of this section is to describe a method which allows to find all orthogonal symmetries of spatial configurations. Conceptually, this is the same as in Moczurad and Zgliczyński (2019). However, the task of finding symmetries in 3D is a bit more involved, and thus we are more formal this time and we devote a full section to it. In this section, we index bodies from 0 to \(n-1\) to be in the agreement with the program. Let us stress that in Sect. 3.3 we describe an effective test, which tells us whether a solution of \({{\mathcal {RS}}}\) is coplanar (which means that it is contained in the plane \(\{z=0\}\)). As in Sect. 3.3 (modulo indexing bodies starting from 0), we assume that \({{\mathcal {RS}}}\) is defined by \(k_1=n-2\) and \(k_2=0\).

Definition 2

Let \(\sigma :\{0,1,\dots ,n-1\} \rightarrow \{0,1,\dots ,n-1\}\) be a permutation (i.e., \(\sigma \in S_n\)) and \(R \in {\mathcal {O}}(3)\). Then \((R,\sigma )\) is an (orthogonal) symmetry of a configuration \((q_0,\dots ,q_{n-1})\) iff

$$\begin{aligned} q_{\sigma (i)}=R q_i, \quad i=0,1,\dots ,n-1. \end{aligned}$$
(22)

Definition 3

For \(\sigma \in S_n\) and \(R \in {\mathcal {O}}(3)\), we define a map

$$\begin{aligned} (R,\sigma ): ({\mathbb {R}}^3)^n \rightarrow ({\mathbb {R}}^3)^n , \end{aligned}$$

by

$$\begin{aligned} ((R,\sigma )q)_{\sigma (i)}=Rq_i, \quad i=0,1,\dots ,n-1. \end{aligned}$$
(23)

Obviously, q has symmetry \((R,\sigma )\) iff \((R,\sigma )q=q\).

In \({{\mathcal {RS}}}\) the special role is given to the OX-axis (which contains \(q_{n-2}\)) and the plane \(\{z=0\}\) (containing \(q_0\) and \(q_{n-2}\)). This should be taken into account when looking for orthogonal matrix R which is a symmetry of a given CC. It should act so that from one solution of \({{\mathcal {RS}}}\) we should obtain another solution of \({{\mathcal {RS}}}\) .

Let us stress that in the paper a normalized central configuration q is the unique solution of \({{\mathcal {RS}}}\) in an interval set Z and is represented by this interval set. The basic idea is first to find a good candidate for R and then look for possible \(\sigma \). Once we have such candidate \((R,\sigma )\), we take \(Z'=\text{ intervalHull }(Z,(R,\sigma )(Z))\) and show, using the Krawczyk method, the uniqueness of CC in \(Z'\). From this, it follows that q and its symmetric image \((R,\sigma )q\), both being the solutions of the reduced system in \(Z'\), coincide.

Below we describe a procedure, which allows to find all symmetries of a given CC. We need to find both \(R \in {\mathcal {O}}(3)\) and \(\sigma \in S_n\).

4.1 Finding candidates for \((R, \sigma )\)

CC is given as interval sets \(q_i \subset {\mathbb {R}}^3\), \(i=0,1,\dots ,n-1\). We assume that

$$\begin{aligned} x_{n-2}>0, \quad y_0 >0. \end{aligned}$$
(24)

Moreover, we assume that \(y_0 \geqslant |y_i|\), for \(i=0,1,\dots ,n-1\). Observe that this condition implies that CC is not collinear. In (4.1.1)–(4.1.4), we describe each step of construction of \((R,\sigma )\).

4.1.1 Initialization of \(\sigma \)

At the beginning, \(\sigma (i)\) is undefined for all i.

4.1.2 Finding candidates for symmetric images of \(q_{n-2}\) and \(q_0\)

Since we need to check all possibilities, we repeat the below procedure for each \(i\in \{0, \ldots , n-1\}\):

  • if \(|q_i|\cap |q_{n-2}|\ne \emptyset \), then set \(\sigma (n-2) = i\);

  • if \(0 \in |q_i|\), then we would exit with failure; however, this never happens in our program;

  • let us define \(e_x = q_i/|q_i|\) and \(t_j = |q_j - (q_j|e_x)e_x|\). We look for \(j\ne i\) such that \(t_j \cap y_0 \ne \emptyset \). If there is no such j, we abandon the construction and continue for the next i. Otherwise, we set \(\sigma (0)=j\).

In this way, we identify all possible images of \((n-2)\)-th and 0-th bodies by orthogonal symmetries. Observe that if \(\sigma (n-2)=n-2\) and \(\sigma (0)=0\), then R is an identity on the plane \(z=0\). i.e., R is either an identity in \({\mathbb {R}}^3\) or the reflection with respect to \(z=0\) plane.

4.1.3 Constructing \(R_+\) and \(R_-\): the candidates for the symmetries

At this moment, we have a candidate for the image of the plane \(\{z=0\}\). This is a plane containing 0, \(q_{\sigma (n-2)}\) and \(q_{\sigma (0)}\). If \(0 \in |{\tilde{e}}_y|\), we abandon the construction and return failure (this never happens in the program, because we know that the solution is not collinear and bounds on \(q_i\) are tight). Let us define \({\tilde{e}}_y=q_{\sigma (0)} - (q_{\sigma (0)}|e_x)e_x\) and \(e_y={\tilde{e}}_y/ |{\tilde{e}}_y|\). We define \(e_z=e_x \times e_y\). Now we have two possibilities for R, denoted by \(R_\pm \). We define them on the standard basis \(e_1,e_2,e_3\) as follows

$$\begin{aligned} R_\pm (e_1)=e_x, \quad R_\pm (e_2)=e_y, \quad R_\pm (e_3)=\pm e_z. \end{aligned}$$

Observe that \(\det R_+=1\) and \(\det R_-=-1\).

4.1.4 Construction of \(\sigma \)

Let us fix \(R=R_+\) or \(R=R_-\). We extend the definition of \(\sigma \) by demanding that for each i there exists unique \(j=\sigma (i)\), such that

$$\begin{aligned} q_{j} \cap R q_i \ne \emptyset . \end{aligned}$$

4.2 Geometric description of \((R,\sigma )\)

Once we know a symmetry \((R,\sigma )\) of CC q, we want to recognize its geometric features, for example, the angle of rotation etc.

We have two possibilities for R: It is either a rotation around some axis or is an improper rotation, which is composition of rotation (we allow also for identity) with reflections and are characterized by orthogonal matrices with determinant \(-1\). We should stress that even though R is given as an interval matrix, we can give exact value of the rotation angle, since we have discrete set of points \(q_i\) which are permuted by \(\sigma \).

4.2.1 Rotations

The eigenvalues of R are \(\{1, e^{\pm i \varphi }\}\). The eigenvector corresponding to 1 is the axis of rotation and in the perpendicular plane we have a rotation by the angle \(\varphi \).

In order to determine \(\varphi \), we decompose \(\sigma \) into cycles. The cycles of length 1 consist of points on the rotation axis. All other cycles should be of the same length k and the rotation angle is \(\varphi =2\pi /k\). Observe that there must be \(k>0\) because we assumed that configuration is non collinear.

4.2.2 Improper rotations

The eigenvalues of R are \(\{-1, e^{\pm i \varphi }\}\). We have three possibilities:

  1. 1.

    the eigenvalue \(-1\) has multiplicity three. In such situation, \(R=-\text{ Id } \,\) and the decomposition of \(\sigma \) into cycles have the following properties: there exists at most one cycle of length 1 (this must be \(q_i=0\)) and all other cycles are of length two,

  2. 2.

    the eigenvalue \(-1\) has multiplicity one and 1 has the multiplicity two. This is a reflection with respect to some plane. Thus the decomposition of \(\sigma \) into cycles has the following properties:

    • there might be several cycles of length 1, these are points on the reflection plane,

    • all other cycles are of length 2, if the configuration is non-coplanar then at least one such cycle should appear,

  3. 3.

    all eigenvalues have multiplicity one. The eigenvector corresponding to \(-1\) is the axis of rotation and in the perpendicular plane we have a rotation by the angle \(\varphi \). In this situation, the decomposition of \(\sigma \) into cycles has the following properties:

    • there can be at most one cycle of length 1, this must a point at the origin.

    • there can be cycles of length 2, located on the ”rotation” axis,

    • all other cycles should be of the length k or 2k, where \(\varphi =2\pi /k\); observe that k must be greater than 2 (\(k=2\) is taken care of in the case of \(-1\) having multiplicity three).

From the above consideration, it is clear that looking on \(\sigma \) alone we might not be able to distinguish the cases 1 and 2, but this can be done easily by additionally estimating the eigenvalues of R. In the third case we may need to compute eigenvalues to decide on the value of k.

To determine the angle \(\varphi \) we use the fact that if A is a linear operator represented by a square matrix and \(\lambda _1, \ldots , \lambda _n\) are the eigenvalues of A, then

$$\begin{aligned} \text{ tr } \,(A) = \sum _{i=1}^n \lambda _i = \sum _{i=1}^n a_{ii}. \end{aligned}$$

Since in the orientation reversing case the eigenvalues of R are \(\{-1, e^{\pm i \varphi }\}\), we obtain

$$\begin{aligned} \text{ tr } \,(R)= & {} -1 + e^{i\varphi } + e^{-i\varphi }\\= & {} -1 + 2\cos \varphi , \end{aligned}$$

hence

$$\begin{aligned} \cos \varphi = \frac{\text{ tr } \,(R) + 1}{2}. \end{aligned}$$
(25)

Moreover, if \(R = -\text{ Id } \,\), then \(\text{ tr } \,(R) = -3\), and if R is a reflection with respect to some plane, then \(\text{ tr } \,(R)=1\). Based on the above observations, we recognize R as follows:

  • if all cycles in \(\sigma \) are of length at most 2, then

    • if \(\text{ tr } \,(R) < 1\), then \(R=-\text{ Id } \,\)

    • if \(\text{ tr } \,(R) > -3\), then R is a refection with respect to the plane perpendicular to the vector \(q_i-q_{\sigma (i)}\), where i is such that \(\sigma (i)\ne i\), i.e., we take any cycle of length two

    • if neither of the above holds, then we cannot decide between \(R=-\text{ Id } \,\) and R being the reflection (this never happens in our computations)

  • if \(\sigma \) contains a cycle of length \(k>2\), then we have either \(\varphi =\frac{2\pi }{k}\) or \(\varphi =\frac{2\pi }{k/2}=\frac{4\pi }{k}\). The correct value is obtained by testing the formula (25), i.e.,

    • if only one of the following conditions holds

      $$\begin{aligned} \cos \frac{2\pi }{k} \in \frac{\text{ tr } \,(R) + 1}{2}&\quad \text{ or }\quad&\cos \frac{4\pi }{k} \in \frac{\text{ tr } \,(R) + 1}{2}, \end{aligned}$$

      then we know the value of \(\varphi \),

    • otherwise, we cannot decide between these two possibilities (this never happens in our computations).

5 On the computer-assisted proof

We normalized masses so that \(M=\sum _i m_i=1\). As in the previous section we index bodies from 0 to \(n-1\) to be in the agreement with the program. In the sequel, we study the following \({{\mathcal {RS}}}\) with \(k_1=n-2\) and \(k_2=0\) (this is the same choice as in the previous section)

$$\begin{aligned} q_i= & {} \frac{1}{m_i}f_i(q_0,\dots ,q_{n-2},q_{n-1}(q_0,\dots ,q_{n-2})), \quad i\in \{1,\dots ,n-3\}, \end{aligned}$$
(26)
$$\begin{aligned} x_{n-2}= & {} \frac{1}{m_{n-2}}f_{n-2,x}(q_0,\dots ,q_{n-2},q_{n-1}(q_0,\dots ,q_{n-2})), \end{aligned}$$
(27)
$$\begin{aligned} x_{0}= & {} \frac{1}{m_{0}}f_{0,x}(q_0,\dots ,q_{n-2},q_{n-1}(q_0,\dots ,q_{n-2})), \end{aligned}$$
(28)
$$\begin{aligned} y_{0}= & {} \frac{1}{m_{0}}f_{0,y}(q_0,\dots ,q_{n-2},q_{n-1}(q_0,\dots ,q_{n-2})), \end{aligned}$$
(29)

where

$$\begin{aligned} y_{n-2}= & {} 0, \end{aligned}$$
(30)
$$\begin{aligned} z_{n-2}= & {} 0, \end{aligned}$$
(31)
$$\begin{aligned} z_{0}= & {} 0, \end{aligned}$$
(32)
$$\begin{aligned} q_{n-1}(q_0,\dots ,q_{n-2})= & {} -\frac{1}{m_{n-1}}\sum _{i=0}^{n-2} m_i q_i, \end{aligned}$$
(33)

5.1 Equal mass case, the reduction in the configuration space for CCs

After a suitable permutation of bodies and an orthogonal transformation, it is easy to see that each CC has its equivalent in the set of the configurations satisfying the following conditions

  • \(q_{n-2}=(x_{n-2},0,0)\) is the furthermost body from the origin, \(x_{n-2}>0\),

  • \(q_0=(x_0,y_0,0)\) is the point furthest from the line OX (which is determined by \(q_{n-2}\)), \(y_0 \geqslant 0\),

  • \(q_1=(x_1,y_1,z_1)\) is the point furthest from the plane OXY (which is determined by \(q_{n-2}\) and \(q_0\)), \(z_1 \geqslant 0\),

  • all other bodies have their x coordinates in the order of increasing/decreasing indices.

This, combined with Theorem 9 and Lemma 10 in Moczurad and Zgliczyński (2019), shows that it is enough to consider the following set in which we look for the central configurations

$$\begin{aligned} 0.5 \leqslant x_{n-2}\leqslant & {} (n-1), \end{aligned}$$
(34)
$$\begin{aligned} x_{n-2}\geqslant & {} |q_i| , \quad i=0,\dots ,n-1, \end{aligned}$$
(35)
$$\begin{aligned} x_{n-2}\geqslant & {} |x_i|, \quad i=0,\dots ,n-1, \end{aligned}$$
(36)
$$\begin{aligned} y_{n-2}= & {} z_{n-2} = 0\end{aligned}$$
(37)
$$\begin{aligned} y_0\geqslant & {} |y_i| , \quad i=0,\dots ,n-1 \end{aligned}$$
(38)
$$\begin{aligned} z_0= & {} 0\end{aligned}$$
(39)
$$\begin{aligned} z_1\geqslant & {} |z_i| , \quad i=0,\dots ,n-1 \end{aligned}$$
(40)
$$\begin{aligned} x_2\geqslant & {} x_3 \geqslant \dots \geqslant x_{n-3} \geqslant x_{n-1}. \end{aligned}$$
(41)

We call this order decreasing due to the requirement (41).

5.2 Outline of the approach

In the algorithm, we look for all zeros of the reduced system (2629), which under assumptions (A1) and (A2) of Theorem 2 is equivalent to (3) for the non coplanar solutions, while (A1) is sufficient for the coplanar ones. These assumptions translate to the following conditions

$$\begin{aligned} x_{n-2}\ne & {} x_{n-1}, \end{aligned}$$
(42)
$$\begin{aligned} \det \begin{bmatrix} (x_0-x_{n-1}) &{} (x_{n-2}-x_{n-1}) \\ (y_0-y_{n-1}) &{} (y_{n-2}-y_{n-1}) \end{bmatrix}\ne & {} 0. \end{aligned}$$
(43)

Observe that condition (35) implies (42).

In the program, we verify condition (43) computing the determinant for the non-planar solutions of the reduced system.

During the program, proving the existence of a locally unique solution in a box is just as important as proving that there is no solution there. Just as in Moczurad and Zgliczyński (2019) for proving the existence, we use the Krawczyk operator applied to the system (2629). To rule out the existence of a solution, we use the exclusion tests discussed in Section 4 in Moczurad and Zgliczyński (2019) and also the Krawczyk operator.

As in Moczurad and Zgliczyński (2019) the collinearity, coplanarity and the symmetries of CCs are established by proving the uniqueness in a suitable symmetric box. (See Sects. 3.3 and 4 for details.)

5.3 The algorithm

The algorithm runs in the reduced configuration space which is a subset of \({\mathbb {R}}^{3(n-1)-3}\), i.e., a configuration is represented by a point \((x_0, y_0,x_1,y_1,z_1,\ldots , x_{n-3}, y_{n-3},z_{n-3}, x_{n-2})\). Physically, we interpret such a configuration as the positions of \(n-1\) bodies with \(q_0=(x_0,y_0,0)\), \(q_i=(x_i, y_i,z_i)\) for \(i = 1, \ldots , n-3\) and \(q_{n-2}=(x_{n-2},0, 0)\). From (33), we obtain \(q_{n-1}\)—the position of the last body.

The algorithm is the same as for \(d=2\), which was discussed in Moczurad and Zgliczyński (2019). The data types are essentially the same, with obvious modifications taking into account the dimension of the space.

5.4 Technical data

The main computations were carried out in parallel using the template function std::async (from the standard C++ library) which runs the function asynchronously (potentially in a separate thread which may be part of a thread pool) on Dell R930 4x Intel Xeon E7-8867 v3 (2,5GHz, 45MB), 1024 GB RAM. The compiler is gcc version 4.9.2 (Debian 4.9.2-10+deb8u2). Times obtained for different number of bodies are presented in Table 1.

Table 1 Comparison of execution times for different number of bodies. Third column contains total of CPU times used by all threads during the run of the program; fourth—real or clock times

5.5 Source and references files

Locations of all files related to the numerical part of the proof are specified in Table 2; these include source code of the program with installation instructions, two kinds of output files (text and binary) and Mathematica notebooks with CCs presentations.

Note that we use interval arithmetic, and therefore positions of bodies, in both: text and binary output files, are given as truncated intervals containing the true value. Positions of bodies used in Mathematica notebooks cannot be treated as reference; these notebooks are attached only for visualization.

In the text report files, all non-planar CCs are given together with the symmetries description; planar ones—are indicated by numbering collinear solution no... or planar solution no ....

In binary files, we store just the positions of bodies. These are numbers written with the accuracy obtained by the program; in our case, it is IEEE 754 double. The data can be read independently—see the function readBodiesFromBinaryFile in the source code.

Table 2 All links above refer to the directory: http://www.ii.uj.edu.pl/~zgliczyn/papers/cc-3d/

6 Spatial central configurations

The program finds all spatial configurations and writes them into output files. However, planar configurations have been already treated in Moczurad and Zgliczyński (2019), and thus here we present only non-coplanar CCs for \(n=5\) and \(n=6\). We identify CCs presenting them with U, \(J=U \sqrt{I}\) and the position of the before last body (recall that \(q_{n-2} = (x_{n-2}, 0, 0)\)) The geometric meaning of \(x_{n-2}\) is: This is the distance of the body in CC which is furthers from the center of mass, and hence this is another reasonable measure of the size of CC. We present CCs also in figures so that their shape is more visible.

6.1 Five bodies

We prove that there exist four classes of non-coplanar central configurations. This confirms the results from Lee and Santoprete (2009), Kotsireas (2000). For each non-coplanar central configuration, we find its all orthogonal symmetries. These configuration are

  • diamond with triangular base Fig. 1. Two symmetric pyramids with an equilateral triangle \(q_0q_1q_2\) as the base, the summits of the pyramids \(q_3\) and \(q_4\) are on the axis perpendicular to the base plane passing through the center of mass. This solution in discussed in (Kotsireas 2000, Sec. 5.2.3) as degree 12 solution and appears Fig. 3a in Lee and Santoprete (2009)

  • a pyramid with square base Fig. 2 , the square \(q_0q_3q_2q_4\) with \(q_1\) in the summit on the symmetry line; all triangular faces are equilateral; c indicates the center of mass and this lie over the plane of the square; diagonals of the square are marked as dashed blue lines. This is called a square pyramid in (Kotsireas 2000, Sec. 5.2.1) and appears as Fig. 3b in Lee and Santoprete (2009)

  • the regular tetrahedron (Fig. 3) \(q_0q_1q_3q_4\) with body \(q_2\) at the origin, this is regular tetrahedron from (Kotsireas 2000, Sec. 5.2.2) and appears as Fig. 4a in Lee and Santoprete (2009)

  • triangle pyramid (“perturbed tetrahedron”) (Fig. 4) with an equilateral triangle \(q_0q_1q_4\) as the base and \(q_3\) at the summit; \(q_2\) is inside; c is at the origin. We identify it with the one discussed in (Kotsireas 2000, Sec. 5.2.4) as degree 43 solution , although no geometric description of the obtained central configuration has been given there. It appears as Fig. 4b in Lee and Santoprete (2009).

Fig. 1
figure 1

Diamond with triangular base, \(n=5\)

Fig. 2
figure 2

Pyramid with square base, \(n=5\)

Fig. 3
figure 3

The regular tetrahedron, \(n=5\)

Fig. 4
figure 4

Perturbed tetrahedron, \(n=5\)

6.2 Six bodies

There are nine classes of non-coplanar central configurations:

  1. 1.

    diamond with a square base (Fig. 5), two symmetrical pyramids with a common square base \(q_0q_4q_2q_5\); \(q_1\) is the summit of the upper pyramid, \(q_3\)—the lower one. Points \(q_1\), \(q_3\) and (0, 0, 0) (the center of mass) are collinear,

  2. 2.

    diamond with regular triangular base (Fig. 6), two tetrahedrons with a common equilateral triangle base \(q_0q_1q_3\); \(q_2 = (0,0,0)\) lies at the triangle plane (i.e., bodies \(q_0\), \(q_1\), \(q_2\) and \(q_3\) are coplanar); \(q_4\) is the summit of right pyramid, \(q_5\)—the left one. Points \(q_2\), \(q_4\) and \(q_5\) are collinear,

  3. 3.

    two pyramids (one inside the other) (Fig. 7) with a common square base \(q_0q_1q_3q_5\); \(q_2\) is the summit of inner pyramid, \(q_4\)—the outer one. Points \(q_2\), \(q_4\) and (0, 0, 0) (the center of mass) are collinear,

  4. 4.

    two orthogonal isosceles triangles (Fig. 8); altitudes of both triangles, points \(q_2\), \(q_4\) and the center of mass (point (0, 0, 0)) lie on the line marked red on the picture, which is an intersection line of the planes containing the triangles,

  5. 5.

    two pyramids (one inside the other) (Fig. 9) with a common square base \(q_0q_1q_3q_5\); \(q_2\) is the summit of inner pyramid, \(q_4\)—the outer one. Points \(q_2\), \(q_4\) and (0, 0, 0) (the center of mass) are collinear,

  6. 6.

    pentagonal pyramid, (Fig. 10) a polyhedron with a regular pentagon \(q_0q_4q_2q_5q_3\) as a base; \(q_1\) is the summit and is collinear with (0, 0, 0) (the center of mass),

  7. 7.

    a prism (Fig. 11), a polyhedron with three rectangular and two triangular faces; \(q_0q_1q_4\) and \(q_2q_3q_5\) are symmetrical equilateral triangles, thus rectangles \(q_0q_1q_3q_5\), \(q_0q_4q_2q_5\) and \(q_1q_3q_2q_4\) are also of the same size—lines with the same length have the same color (red or blue),

  8. 8.

    triangular polyhedron (Fig. 12) with faces being isosceles triangles; \(q_0q_1q_4\) and \(q_0q_1q_5\) are symmetrical, \(q_0q_4q_5\) has one side longer (the segment \((q_4, q_5)\) is longer than \((q_0,q_1)\)); triangle \(q_0q_2q_3\) is also an isosceles triangle; lines with the same length have the same color (red or blue),

  9. 9.

    diamond with triangular base, (Fig. 13) two symmetrical polyhedrons with triangular base \(q_1q_4q_5\) and with summits at \(q_0\) and \(q_3\); the center of mass ((0, 0, 0)) and the point \(q_2\) lies on the plane of triangle \(q_1q_4q_5\) and points \(q_0\), \(q_3\) are symmetrical with respect to that plane (i.e., triangle \(q_0q_2q_3\) is isosceles); in the figure point c is the center of mass (0, 0, 0); lines with the same length have the same color (red, blue, magenta, green and orange).

Fig. 5
figure 5

Diamond with a square base (item 1), \(n=6\)

Fig. 6
figure 6

Diamond with regular triangular base (item 2), \(n=6\)

Fig. 7
figure 7

Two pyramids (item 3), \(n=6\)

Fig. 8
figure 8

Two orthogonal isosceles triangles (item 4) \(n=6\)

Fig. 9
figure 9

Two pyramids (item 5), \(n=6\)

Fig. 10
figure 10

Pentagonal pyramid (item 6), \(n=6\)

Fig. 11
figure 11

Prism (item 7), \(n=6\)

Fig. 12
figure 12

Triangular polyhedron (item 8), \(n=6\)

Fig. 13
figure 13

Diamond with isosceles triangular base (item 9) \(n=6\)

7 The number of non-equivalent CCs for mass parameters close to equal mass case

In this section, we count the number of CCs (different equivalency classes of CCs) for mass parameters close to the equal mass case. This is possible because for the equal mass case all CCs turned out to be non-degenerate solutions of \({{\mathcal {RS}}}\) . Then a simple continuation argument allows us to infer that the number of CCs does not change. A single CC for the equal mass case give rise (can be continued) to multiple CCs when the masses differ.

Example 1

Consider a square—one of the planar CCs for \(n=4\). We distinguish bodies by (possibly different) masses, and we show how many CCs for different masses we obtain. We use vertex labeling (coloring) for this purpose. To suggest different arrangement of masses, we use \(m_1,\ldots , m_4\) colors. We identify those colorings which result from the application of planar isometry to a CC. In Fig. 14, we present all three different arrangements of bodies for CC being a square under symmetry group \({\mathcal {O}}(2)\). Notice that there are 4! = 24 different colorings of a square by four colors \(m_1, \ldots , m_4\). However, some of them can be obtained from another by rotation or reflection. For example, arrangement \(m_4m_1m_2m_3\) can be obtained from \(m_1m_2m_3m_4\) by rotating latter by \(\pi /2\) angle.

Fig. 14
figure 14

All non-isomorphic (non-\({\mathcal {O}} (2)\) equivalent) colorings of a square with colors \(m_1, \ldots , m_4\)

If we use \({\mathcal {SO}}(2)\) as the symmetry group (we allow rotations and reject reflections), we obtain six colorings, i.e., six non-congruous CCs (see Fig. 15).

Fig. 15
figure 15

All non-congruous (not \({\mathcal {SO}}(2)\)-equivalent) colorings of a square with colors \(m_1, \ldots , m_4\)

Summarizing: if we use just rotations, then there are six non-equivalent CCs obtained from the square in the different masses case; if we additionally allow reflections, then we obtain only three non-equivalent CCs.

7.1 Definitions and Pólya’s theorem

We can count non-isomorphic colorings with the aid of Pólya’s enumeration theorem, see Biggs (1989) or Pólya et al. (1983)) for the detailed treatment and proof. In this section, we just recall the relevant definitions and state main theorems.

Let X be a finite set with \(|X| = n\) and \((G, \circ )\) be a subgroup of the group of permutations of X with \(\circ \) denoting the composition of permutations.

Coloring of X is a function \(\omega : X\rightarrow {\mathcal {C}}\), where \( {\mathcal {C}}\) is a finite set of colors. We assume that the cardinality of \({\mathcal {C}}\) is k, and it is the only important feature of \({\mathcal {C}}\). Notice that any non-negative k has a combinatorial sense, but in our counting problem, we have to take \(k = n\).

Definition 4

We say that two colorings \(\omega _1\), \(\omega _2\) are isomorphic with respect to group G, if there is a \(g\in G\) such that

$$\begin{aligned} \omega _1(g(x)) = \omega _2(x), \quad \forall x\in X. \end{aligned}$$

Definition 5

An index of a permutation \(g:X\rightarrow X\) is a function of n variables

$$\begin{aligned} \zeta _{g}\left( x_1,\ldots ,x_n \right) =x_1^{\alpha _1}\cdot x_2^{\alpha _2}\cdot \ldots \cdot x_n^{\alpha _n}, \end{aligned}$$

where \(\alpha _i\) is a number of cycles of permutation g containing exactly i elements for \(i=1,2,\ldots , n.\)

Definition 6

An index of a group G is defined as

$$\begin{aligned} \zeta _G(x_1, \ldots , x_n) = \frac{1}{|G|} \sum _{g\in G}\zeta _{g}(x_1, \ldots , x_n). \end{aligned}$$

Theorem 5

The number of non-isomorphic colorings of X with respect to group G with k colors is

$$\begin{aligned} \zeta _{G}\left( k,k,\ldots ,k \right) . \end{aligned}$$

Example 2

Let \(X = \{1 , 2, 3, 4\}\) be a square as in Fig. 14. Let \((G_{\mathrm{RR}}, \circ )\) be a group of all rotations and reflections (on a plane) of that square. Permutations in \(G_{\mathrm{RR}}\), their decomposition in two cycles and their indices are:

\(g_0\)

(1)(2)(3)(4)

\(\zeta _{g_0}(x_1, x_2, x_3, x_4) = x_1^4\)

identity

\(g_1\)

(1234)

\(\zeta _{g_1}(x_1, x_2, x_3, x_4) = x_4\)

rotation by \(\pi /2\)    (one 4-elements cycle)

\(g_2\)

(13)(24)

\(\zeta _{g_2}(x_1, x_2, x_3, x_4) = x_2^2\)

rotation by \(\pi \)    (two 2-elements cycles)

\(g_3\)

(1432)

\(\zeta _{g_3}(x_1, x_2, x_3, x_4) = x_4\)

rotation by \(3\pi /2\)    (one 4-elements cycle)

\(g_4\)

(1)(3)(24)

\(\zeta _{g_4}(x_1, x_2, x_3, x_4) = x_1^2 x_2\)

reflection with respect to line (13)    (two 1-element cycles, one 2-element cycle)

\(g_5\)

(13)(2)(4)

\(\zeta _{g_5}(x_1, x_2, x_3, x_4) = x_1^2 x_2\)

reflection with respect to line (24)

\(g_6\)

(12)(34)

\(\zeta _{g_6}(x_1, x_2, x_3, x_4) = x_2^2\)

reflection with respect to line perpendicular to the segment (12)   (two 2-elements cycles)

\(g_7\)

(14)(23)

\(\zeta _{g_7}(x_1, x_2, x_3, x_4) = x_2^2\)

reflection with respect to line perpendicular to the segment (14).

Thus \(G_{\mathrm{RR}} = \{g_0, \ldots , g_7\}\), \(|G_{\mathrm{RR}}| = 8\) and its index is

$$\begin{aligned} \zeta _{G_{\mathrm{RR}} } (x_1, x_2, x_3, x_4) = \frac{1}{8}\left( x_1^4 + 3x_2^2 + 2x_4 + 2x_1^2x_2\right) . \end{aligned}$$

Hence the number of different coloring with 4 colors is

$$\begin{aligned} \zeta _{G_{\mathrm{RR}} } (4,4,4,4) = \frac{1}{8}\left( 4^4 + 4^2 + 2\cdot 4\right) = 2360. \end{aligned}$$

In this number, there are all colorings with one, two, three, and four colors, and this is not what we are interested in. We want to know a number of different colorings with exactly four colors. For this we need full version of Pólya’s enumeration Theorem.

Theorem 6

(Pólya’s enumeration Theorem) Let D be a set of all non-isomorphic colorings of a set X with respect to G with k colors. Then the generating function \({\mathcal {U}}_{D}\)

$$\begin{aligned} {\mathcal {U}}_{D}\left( x_1,x_2,\ldots ,x_k \right) =\zeta _{G}\left( \sigma _1,\sigma _2,\ldots ,\sigma _n \right) , \end{aligned}$$

where

$$\begin{aligned} \sigma _i=x_1^i+x_2^i+\ldots +x_k^i, \end{aligned}$$

has its coefficient at \(x_1^{i_1}\ldots x_k^{i_k}\) equal to the number of non-isomorphic colorings using each of the colors \(m_1, \ldots , m_k\) exactly \(i_1, \ldots , i_k\) times, respectively.

Example 3

Consider the case from Example 2. The generating function is

$$\begin{aligned} {\mathcal {U}}(x_1, x_2, x_3, x_4)= & {} \zeta _{G_{\mathrm{RR}}}(x_1+x_2+x_3+x_4, x_1^2+x_2^2+x_3^2+x_4^2, \nonumber \\&\quad x_1^3+x_2^3+x_3^3+x_4^3, x_1^4+x_2^4+x_3^4+x_4^4) \nonumber \\= & {} \frac{1}{8} \Big \{\ (x_1+x_2+x_3+x_4)^4 \nonumber \\&+ 3(x_1^2 + x_2^2 + x_3^2 + x_4^2)^2 \nonumber \\&+ 2(x_1^4 + x_2^4 + x_3^4 +2x_4^4) \nonumber \\&+ 2(x_1 + x_2 + x_3 + x_4)^2(x_1^2 + x_2^2 + x_3^2 + x_4^2)\Big \}\nonumber \\= & {} x_1^4 + \ldots + \mathbf{3 }x_1x_2x_3x_4 + \ldots x_4^4 \end{aligned}$$
(44)

From the expansion above (44), we see that the number of non-isomorphic colorings with four colors, each used once, in the case of square is 3.

7.2 Identifications of CCs made for the main result: summary

Let us remind the essence of counting of CCs mentioned in Sect. 1.2: any CC can be obtained from one of the normalized CCs, found by the program, by a suitable composition of translation, scaling, rotation, reflection and permutation of bodies. The reasons for such situation are as follows.

  1. 1.

    First, in Sect. 2, we normalized central configurations to obtain isolated solutions. This means that we remove possibility of translation (establishing a center of mass \(c=0\)) and possibility of scaling symmetry (setting \(\lambda = 1\)). However, the obtained system (3) can have \({\mathcal {O}}(3)\) and \({\mathcal {SO}}(3)\) symmetry, which excludes the use of Krawczyk’s method. This method requires non-degenerate solutions .

  2. 2.

    In Sect. 3.2, we introduce \({{\mathcal {RS}}}\) and after these treatments solutions found by our program have no longer neither \({\mathcal {SO}}(3)\) nor \({\mathcal {O}}(3)\) symmetry. But reflections by the planes and some permutations of bodies are still possible. In the case of equal masses, two CCs that differ only by labeling of bodies are equivalent.

  3. 3.

    Thus we remove remaining symmetries and possible permutations of bodies by a procedure of unifications of solutions.

Thus, in the equal mass case we count configurations treating them as indistinct if they have the same geometrical form. In the context of the Póyla’s theorem, this corresponds to taking the permutation group G to be the whole of \({\mathcal {S}}_n\) or, equivalently, using just one color.

7.3 Number of CCs for different masses close to equal mass case

In the different masses case in the context of the Póyla’s theorem, we use exactly n colors. We consider two CCs equivalent, isomorphic or congruous, if one can be transformed into another by an element of \({\mathcal {O}}(d)\) or \({\mathcal {SO}}(d)\), respectively. Hence depending on whether we allow for reflections (i.e., \({\mathcal {O}}(d)\)) or not (\({\mathcal {SO}}(d)\)) we get different counts.

In the sequel by no-CC(n) we denote the number of CCs obtained for the equal masses case, by \(\text{ iso }(n)\) is number of different non-isomorphic CCs and the number of non-congruous CCs is denoted by \(\text{ cong }(n)\). These numbers are obtained using Pólya’s enumeration Theorem for different groups. Tables 3 and 4 contain the number of different CCs inferred from our rigorous count of CCs. Data in Tables 3 and 4 suggest that the number of different CCs in the different mass case grow faster than (n!).

Example 4

For \(n= 4\) in the planar case there are four different CCs (i.e., no-CC(4) = 4) and there are 12 non-isomorphic configurations for collinear solution, 3 for square, 4 for equilateral triangle and 12 for isosceles triangle (for detailed description of CCs see Moczurad and Zgliczyński 2019), thus iso\((4) = 12 + 3 + 12 + 4 = 31\). The count of non-congruous classes is as follows: 12 non-congruous configurations for collinear solution, 6 for square, 8 for equilateral triangle and 24 for isosceles triangle, thus \(\text{ cong }(4)=12+6+8+24=50\).

Table 3 The number of different CCs in 2D
Table 4 The number of different CCs in 3D

8 The question of stability for planar CCs

Now, consider a planar CC. It gives rise to a periodic orbit, where all bodies move on circles with the angular velocity 1. This orbit becomes a fixed point in the rotating coordinate frame. The stability/instability of this fixed point and the circular periodic orbit is the same.

Let

$$\begin{aligned} J=\left[ \begin{array}{cc} 0 &{} \quad -1 \\ 1 &{} \quad 0 \\ \end{array} \right] . \end{aligned}$$
(45)

Then \(\exp (J\theta )=\left[ \begin{array}{cc} \cos \theta &{} -\sin \theta \\ \sin \theta &{} \cos \theta \\ \end{array} \right] \) is a rotation by the angle \(\theta \) in the plane OXY.

The link between coordinates in the inertial frame \(x \in {\mathbb {R}}^2 \) and the coordinates q with respect to the rotating frame with the angular velocity equal to 1 is

$$\begin{aligned} x=\exp (Jt)q. \end{aligned}$$
(46)

The equations of motion in the rotating coordinate frame are Simò (1978)

$$\begin{aligned} {\dot{q}}_i= & {} v_i, \end{aligned}$$
(47)
$$\begin{aligned} {\dot{v}}_i= & {} -2 Jv_i + q_i - \sum _{j \ne i} \frac{m_j (q_i - q_j)}{r_{ij}^3}. \end{aligned}$$
(48)

As was mentioned earlier each planar CC with \(v_i=0\), \(i=0,\dots ,n-1\) is a fixed point of the system (47,48). We investigated numerically the linear stability of all CCs for \(n=4,5,6,7\) and for some particular non-symmetric CCs for \(n=8,9,10\) whose existence has been established in Moczurad and Zgliczyński (2019). It turns out that all these CCs are linearly unstable. The computation of eigenvalues for the linearization of (47,48) has been done non-rigorously, but we are confident that this computation can be with some effort made rigorous.