1 Introduction

In 1816, Ottaviano F. Mossotti introduced a method for initial orbit determination of a solar system body employing four optical observations, e.g., the values of right ascension and declination. Assuming geocentric observations, Mossotti’s method allows to write linear equations for the computation of the orbital angular momentum (Mossotti 1942). Then, the orbit can be reconstructed, e.g., by Gibbs’ method (Herrick 1976). This procedure has the advantage to avoid the computation of the roots of the eight degree polynomial appearing in the classical methods by Laplace (1780), Lagrange (1783), and Gauss (1809), which need only three observations but can give rise to multiple solutions. A review of the methods by Laplace and Gauss together with a geometric interpretation of the occurrence of multiple solutions can be found in Gronchi (2009), Milani and Gronchi (2010).

Mossotti’s work was appreciated by Gauss himself, see Gauss (1874). This method has been reviewed in Celletti and Pinzari (2005), where the authors state that the computation of the solution can be seriously affected by the observational errors due to the terms that are neglected in the employed approximation.

In this work, we recall Mossotti’s original method and show that it is possible to define a topocentric version which leads to a quadratic equation for the angular momentum. This generalization of the method turns out to be suitable for orbit determination of Earth satellites too. We investigate the performance of the methods and their sensitivity to observational errors by some numerical tests with simulated data: We compare the original geocentric method with this topocentric version using different time intervals between the observations and different astrometric errors. We also show a comparison test with Gauss’s method using simulated observations with the expected cadence of the VRO–LSST telescope.

2 The original method

Mossotti’s method (Mossotti 1942) leads to a set of linear equations for the components of \(\varvec{c}_\oplus -\varvec{c}\), where \(\varvec{c}_\oplus \) and \(\varvec{c}\) are the angular momenta of the Earth and a solar system body, respectively. The observations are supposed to be made from the center of the Earth. We assume that the observed body is an asteroid, moving along an elliptic Keplerian trajectory with the Sun as the center of force. We also assume that the total observational arc is covered in a much shorter time than the orbital period. This method is also suitable to be used with hyperbolic or parabolic orbits.

Here, we illustrate all the formulae which are necessary for a numerical implementation following Mossotti’s paper steps (Mossotti 1942). However, in the early nineteenth century linear algebra had not been developed yet, and several formulae in Mossotti (1942) can be written and derived in a shorter way. The original formulae can be recovered using Table 3 in “Appendix A”.

2.1 Units and preliminary definitions

In order to simplify the notation, we use the rescaled time \(\theta \), defined by

$$\begin{aligned} \theta = t\sqrt{g\left( 1+\frac{m_\oplus }{m_\odot }\right) }, \end{aligned}$$

where \(m_\oplus \) and \(m_\odot \) are the masses of the Earth and the Sun, and \(g=Gm_\odot \), where G is Newton’s gravitational constant. We also use \(\kappa \) to denote Gauss’s constant \(\sqrt{g}\). In the following, we assume that

$$\begin{aligned} \theta \approx \kappa t, \end{aligned}$$

neglecting the constant \({m_\oplus }/{m_\odot }\).

Take three of the four observations, at epochs \(t_1<t_2<t_3\), and set

$$\begin{aligned} \theta _{ij}= & {} \kappa (t_j - t_i), \\ \varvec{\theta }= & {} (\theta _{23},\,\theta _{31},\,\theta _{12})^t, \end{aligned}$$

where the superscript t stands for transposition. We write \(\varvec{c}\) and \(\varvec{c}_\oplus \) for the orbital angular momenta of the asteroid and the Earth, respectively, and introduce their unit vectors

$$\begin{aligned} {\hat{\varvec{c}}} = \frac{\varvec{c}}{c}, \quad {\hat{\varvec{c}}}_\oplus = \frac{\varvec{c}_\oplus }{c_\oplus }, \end{aligned}$$

where \(c=|\varvec{c}|\), \(c_\oplus =|\varvec{c}_\oplus |\), being \(|\varvec{x}|\) the Euclidean norm of a vector \(\varvec{x}\).

We also write \(\varvec{r}_i\) and \(\varvec{q}_i\) for the heliocentric positions of the asteroid and the Earth at the three epochs \(t_i\) (\(i=1,2,3\)), use \({\varvec{\rho }}_i = \varvec{r}_i - \varvec{q}_i\) for the geocentric position of the asteroid, and introduce the unit vectors

$$\begin{aligned} \hat{\varvec{q}}_i = \frac{\varvec{q}_i}{q_i},\quad {\hat{\varvec{\rho }}}_i = \frac{{\varvec{\rho }}_i}{\rho _i},\quad i=1,2,3, \end{aligned}$$

where \(q_i=|\varvec{q}_i|\), \(\rho _i=|{\varvec{\rho }}_i|\). Finally, we denote the parametersFootnote 1 of the orbits of the asteroid and the Earth by p, \(p_\oplus \). They are defined by

$$\begin{aligned} p\kappa ^2 = c^2,\quad p_\oplus \kappa ^2 = c_\oplus ^2. \end{aligned}$$

2.2 Geometric relations

With the purpose of writing Mossotti’s equations in a compact form, let us introduce the matrices

$$\begin{aligned} P = (\varvec{\rho }_1\, |\, \varvec{\rho }_2 \,|\, \varvec{\rho }_3),\quad Q = (\varvec{q}_1\, |\, \varvec{q}_2 \,|\, \varvec{q}_3),\quad R = P + Q = (\varvec{r}_1\, |\, \varvec{r}_2 \,|\, \varvec{r}_3), \end{aligned}$$
(1)

where \((\varvec{x}_1 \,|\, \varvec{x}_2 \,|\, \varvec{x}_3)\) is the matrix whose columns are the vectors \(\varvec{x}_j\). The corresponding adjugate matrices are

$$\begin{aligned} \begin{aligned} \text {adj}(P)&= \bigl (\varvec{\rho }_2\times \varvec{\rho }_3\,|\, \varvec{\rho }_3\times \varvec{\rho }_1\,|\, \varvec{\rho }_1\times \varvec{\rho }_2\bigr )^t,\\ \text {adj}(Q)&= \bigl (\varvec{q}_2\times \varvec{q}_3\,|\, \varvec{q}_3\times \varvec{q}_1\,|\, \varvec{q}_1\times \varvec{q}_2\bigr )^t,\\ \text {adj}(R)&= \bigl (\varvec{r}_2\times \varvec{r}_3\,|\, \varvec{r}_3\times \varvec{r}_1\,|\, \varvec{r}_1\times \varvec{r}_2\bigr )^t.\\ \end{aligned} \end{aligned}$$

We recall the following property, which holds for any square matrix M:

$$\begin{aligned} M\,\text {adj}(M) = \text {adj}(M)\,M = \det (M)\,I, \end{aligned}$$
(2)

where I is the identity matrix.

Fig. 1
figure 1

Geometry of the three observations

The rank of the matrices Q and R is 2, since each triplet \(\{\varvec{q}_1, \varvec{q}_2,\varvec{q}_3\}\) and \(\{\varvec{r}_1, \varvec{r}_2, \varvec{r}_3\}\) is made by coplanar vectors and we assume that

$$\begin{aligned} \varvec{q}_i\times \varvec{q}_j\ne \varvec{0}, \quad \varvec{r}_i\times \varvec{r}_j\ne \varvec{0}, \quad 1\le i<j\le 3, \end{aligned}$$

see Fig. 1. This implies

$$\begin{aligned} Q\,\text {adj}(Q)= 0, \quad R\,\text {adj}(R) = 0. \end{aligned}$$
(3)

Since the angular momenta \(\varvec{c}\), \(\varvec{c}_\oplus \) are, respectively, orthogonal to the orbital planes of the asteroid and the Earth, we have

$$\begin{aligned} \varvec{c}\cdot \varvec{r}_i = \varvec{c}\cdot (\varvec{q}_i+{\varvec{\rho }}_i)= 0,\quad \varvec{c}_\oplus \cdot \varvec{q}_i = 0, \end{aligned}$$

which lead us to

$$\begin{aligned} \varvec{c}\cdot {\varvec{\rho }}_i = (\varvec{c}_\oplus -\varvec{c})\cdot \varvec{q}_i, \end{aligned}$$
(4)

for \(i=1,2,3\). Then, we introduce the vectors

$$\begin{aligned} \varvec{\tau }=(\tau _1,\,\tau _2,\,\tau _3)^t,\quad \varvec{T}=(T_1,\,T_2,\,T_3)^t, \end{aligned}$$

with

$$\begin{aligned}&\tau _1\sqrt{p} = {\varvec{r}}_2\times {\varvec{r}}_3\cdot {\hat{\varvec{c}}}, \quad T_1\sqrt{p_\oplus } = \varvec{q}_2\times \varvec{q}_3\cdot {\hat{\varvec{c}}}_\oplus , \end{aligned}$$
(5a)
$$\begin{aligned}&\tau _2\sqrt{p} = {\varvec{r}}_3\times {\varvec{r}}_1\cdot {\hat{\varvec{c}}}, \quad T_2\sqrt{p_\oplus } = \varvec{q}_3\times \varvec{q}_1\cdot {\hat{\varvec{c}}}_\oplus , \end{aligned}$$
(5b)
$$\begin{aligned}&\tau _3\sqrt{p} = {\varvec{r}}_1\times {\varvec{r}}_2\cdot {\hat{\varvec{c}}}, \quad T_3\sqrt{p_\oplus } = \varvec{q}_1\times \varvec{q}_2\cdot {\hat{\varvec{c}}}_\oplus . \end{aligned}$$
(5c)

Noting that

$$\begin{aligned} \varvec{\tau } = \frac{1}{\sqrt{p}} \text {adj}(R)\hat{\varvec{c}},\quad \varvec{T} = \frac{1}{\sqrt{p_\oplus }} \text {adj}(Q)\hat{\varvec{c}}_\oplus , \end{aligned}$$
(6)

by (3) we have

$$\begin{aligned} R\,\varvec{\tau } = \varvec{0}, \quad Q\,\varvec{T} = \varvec{0}. \end{aligned}$$

Moreover, recalling that \(R = Q + P\), we get

$$\begin{aligned} P\varvec{\tau } = Q(\varvec{T}-\varvec{\tau }). \end{aligned}$$
(7)

Since \((\varvec{r}_i\times \varvec{r}_j)\times \varvec{c}= (\varvec{q}_i\times \varvec{q}_j)\times \varvec{c}_\oplus = {\varvec{0}}\), we have

$$\begin{aligned} \left( \varvec{r}_i\times \varvec{r}_j\cdot \hat{\varvec{c}}\right) \left( \hat{\varvec{c}}\cdot \varvec{\rho }_i\right)&= \varvec{r}_i\times \varvec{r}_j\cdot \varvec{\rho }_i = \varvec{q}_i\times \varvec{q}_j \cdot \varvec{\rho }_i - \varvec{\rho }_i\times \varvec{\rho }_j \cdot \varvec{q}_i, \end{aligned}$$
(8)
$$\begin{aligned} (\varvec{q}_i\times \varvec{q}_j\cdot {\hat{\varvec{c}}}_\oplus )({\hat{\varvec{c}}}_\oplus \cdot \varvec{\rho }_i)&= \varvec{q}_i\times \varvec{q}_j\cdot \varvec{\rho }_i. \end{aligned}$$
(9)

Remark 1

If the orbits of the Earth and the asteroid are almost coplanar, then Eqs. (8), (9) are close to be degenerate. This singularity is a common feature of orbit determination methods, where the value of the geodesic curvature of the observed arc plays an important role, see Milani and Gronchi (2010), Chap. 9.

The last equality in (8) is obtained by observing that

$$\begin{aligned} \varvec{r}_i\times \varvec{r}_j \cdot \varvec{\rho }_i = \varvec{q}_i\times \varvec{r}_j \cdot \varvec{\rho }_i = \varvec{q}_i\times \varvec{q}_j \cdot \varvec{\rho }_i + \varvec{q}_i\times \varvec{\rho }_j \cdot \varvec{\rho }_i. \end{aligned}$$

By (8), (9) and the definitions of the \(\tau _k\), \(T_k\) given in (5), we obtain

$$\begin{aligned} \varepsilon _{ijk}\frac{\tau _k}{\kappa }\varvec{c}\cdot \varvec{\rho }_i&= \varvec{q}_i\times \varvec{q}_j \cdot \varvec{\rho }_i - \varvec{\rho }_i\times \varvec{\rho }_j \cdot \varvec{q}_i, \end{aligned}$$
(10)
$$\begin{aligned} \varepsilon _{ijk}\frac{T_k}{\kappa }\varvec{c}_\oplus \cdot \varvec{\rho }_i&= \varvec{q}_i\times \varvec{q}_j \cdot \varvec{\rho }_i, \end{aligned}$$
(11)

where \(\varepsilon _{ijk}\) denotes the Levi–Civita symbol, and the indexes ijk vary so that all the 6 permutations of the set \(\{1,2,3\}\) can be considered.

Subtracting (10) from (11), we obtain

$$\begin{aligned} \varepsilon _{ijk}\frac{1}{\kappa }(T_k\varvec{c}_\oplus -\tau _k\varvec{c})\cdot \varvec{\rho }_i = \varvec{\rho }_i\times \varvec{\rho }_j \cdot \varvec{q}_i. \end{aligned}$$
(12)

Writing

$$\begin{aligned} T_k\varvec{c}_\oplus - \tau _k\varvec{c} = T_k\left( \varvec{c}_\oplus -\varvec{c}\right) +\left( T_k-\tau _k\right) \varvec{c}, \end{aligned}$$

and using (4), we have

$$\begin{aligned} \begin{aligned} \left[ T_k\left( \varvec{c}_\oplus -\varvec{c}\right) +\left( T_k-\tau _k\right) \varvec{c}\right] \cdot \varvec{\rho }_i&= T_k\left( \varvec{c}_\oplus -\varvec{c}\right) \cdot \varvec{\rho }_i +\left( T_k-\tau _k\right) \left( \varvec{c}_\oplus -\varvec{c}\right) \cdot \varvec{q}_i \\&= \left( \varvec{c}_\oplus -\varvec{c}\right) \cdot \left[ \hat{\varvec{\rho }}_i +\left( 1-\frac{\tau _k}{T_k}\right) \frac{q_i}{\rho _i}\hat{\varvec{q}}_i\right] T_k\rho _i. \end{aligned} \end{aligned}$$

Therefore, introducing the coefficients

$$\begin{aligned} \alpha _{ik} = \left( 1-\frac{\tau _k}{T_k}\right) \frac{q_i}{\rho _i}, \end{aligned}$$
(13)

and simplifying \(\rho _i\) we can write relations (12) as

$$\begin{aligned} \varepsilon _{ijk}\left( \varvec{c}_\oplus -\varvec{c}\right) \cdot \left( \hat{\varvec{\rho }}_i+\alpha _{ik}\hat{\varvec{q}}_i\right) T_k = \kappa \rho _j\hat{\varvec{\rho }}_i\times \hat{\varvec{\rho }}_j\cdot \varvec{q}_i. \end{aligned}$$
(14)

2.3 Combining geometry of observations with two-body dynamics

Since the time intervals \(\theta _{12}\), \(\theta _{23}\) are small compared to the orbital period,Footnote 2 we can consider Taylor’s expansions of the position of the asteroid and the center of the Earth in their orbital planes as power series of \(\theta _{23}\), \(\theta _{31}\), \(\theta _{12}\) centering the expansion at the intermediate epoch. Then, we neglect the terms depending on the powers of \(\theta _{ij}\) greater than 3. In this way, we obtain

$$\begin{aligned} \varvec{\tau } \simeq \varvec{\theta } - \frac{1}{6r_2^3}\varvec{\theta }^3, \quad \varvec{T} \simeq \varvec{\theta } - \frac{1}{6q_2^3}\varvec{\theta }^3, \end{aligned}$$
(15)

where \(r_2=|\varvec{r}_2|\) and

$$\begin{aligned} \varvec{\theta }^3 = \varvec{\theta } \odot \varvec{\theta } \odot \varvec{\theta } = \left( \theta _{23}^3,\,\theta _{31}^3,\,\theta _{12}^3\right) ^t, \end{aligned}$$

with \(\odot \) denoting the Hadamard product.Footnote 3

Let us define

$$\begin{aligned} \varvec{u}=(u_1,\,u_2,\,u_3)^t=\text {adj}(\hat{P})Q{\varvec{\theta }^3}, \end{aligned}$$

with \(\hat{P} = (\hat{\varvec{\rho }}_1\, |\, \hat{\varvec{\rho }}_2 \,|\, \hat{\varvec{\rho }}_3)\). Multiplying relation (7) on the left by \(\text {adj}(\hat{P})\), and taking into account the approximations (15), we get

$$\begin{aligned} \begin{aligned} \text {adj}(\hat{P})P\varvec{\theta }&\simeq \text {adj}(\hat{P})P\varvec{\tau } = \text {adj}(\hat{P})Q(\varvec{T}-\varvec{\tau })\\&\simeq \text {adj}(\hat{P})Q\varvec{\theta }^3\left( \frac{1}{6r_2^3} - \frac{1}{6q_2^3}\right) = \frac{1}{6}\left( \frac{1}{r_2^3} - \frac{1}{q_2^3}\right) \varvec{u}. \end{aligned} \end{aligned}$$
(16)

Recalling (2), and noting that

$$\begin{aligned} \text {adj}(\hat{P})P = \det (\hat{P})\text {diag}\{\rho _1,\rho _2,\rho _3\}, \end{aligned}$$

relation (16) yields

$$\begin{aligned} \det (\hat{P})\varvec{\delta }\odot \varvec{\theta } \simeq \frac{1}{6}\left( \frac{1}{r_2^3} - \frac{1}{q_2^3}\right) \varvec{u}, \end{aligned}$$
(17)

with

$$\begin{aligned} \varvec{\delta } = (\rho _1,\rho _2,\rho _3)^t. \end{aligned}$$

2.4 A linear equation involving \(\varvec{c}_\oplus -\varvec{c}\)

Choosing \((i,j,k)=(1,2,3), (3,2,1)\) in (14) and eliminating \(\kappa \rho _2\) from the two resulting equations, we obtain

$$\begin{aligned} \begin{aligned}&\left( \hat{\varvec{\rho }}_1\times \hat{\varvec{\rho }}_2 \cdot \varvec{q}_1\right) \left[ \left( \varvec{c}_\oplus -\varvec{c}\right) \cdot \left( \hat{\varvec{\rho }}_3 + \alpha _{31}\hat{\varvec{q}}_3\right) \right] T_1 \\&\quad = \left( \hat{\varvec{\rho }}_2\times \hat{\varvec{\rho }}_3 \cdot \varvec{q}_3\right) \left[ \left( \varvec{c}_\oplus -\varvec{c}\right) \cdot \left( \hat{\varvec{\rho }}_1 + \alpha _{13}\hat{\varvec{q}}_1\right) \right] T_3. \end{aligned} \end{aligned}$$
(18)

In this equation, the only unknowns different from \(\left( \varvec{c}_\oplus -\varvec{c}\right) \) are the coefficients \(\alpha _{13}\) and \(\alpha _{31}\). Using the approximations of \(\varvec{\tau }\) and \(\varvec{T}\) given by (15) in (13), we have

$$\begin{aligned} \alpha _{13} = \frac{q_1}{\rho _1}\frac{T_3-\tau _3}{T_3} \simeq \frac{q_1}{\rho _1T_3} \frac{\theta _{12}^3}{6}\left( \frac{1}{r_2^3}-\frac{1}{q_2^3}\right) \simeq \frac{q_1}{\rho _1} \frac{\theta _{12}^2}{6}\left( \frac{1}{r_2^3}-\frac{1}{q_2^3}\right) , \end{aligned}$$
(19)

where we used \(\frac{1}{T_3} = \frac{1}{\theta _{12}}(1+O(\theta _{12}))\). In a similar way, we obtain

$$\begin{aligned} \alpha _{31} \simeq \frac{q_3}{\rho _3} \frac{\theta _{23}^2}{6}\left( \frac{1}{r_2^3}-\frac{1}{q_2^3}\right) . \end{aligned}$$
(20)

Inserting the approximations of \(\rho _1, \rho _3\) given by (17) into (19), (20) we can express \(\alpha _{13}\), \(\alpha _{31}\) with known quantities:

$$\begin{aligned} \alpha _{13} \simeq \frac{\det (\hat{P})q_1{\theta _{12}^{2}\theta _{23}}}{u_1}, \quad \alpha _{31} \simeq \frac{\det (\hat{P})q_3{\theta _{23}^2\theta _{12}}}{u_3}. \end{aligned}$$
(21)

Defining the coefficients

$$\begin{aligned} a_{1} = \frac{q_2\left( \hat{\varvec{\rho }}_2\times \hat{\varvec{\rho }}_3 \cdot \varvec{q}_3\right) }{T_1\sqrt{p_\oplus }} = \frac{\hat{\varvec{\rho }}_2\times \hat{\varvec{\rho }}_3 \cdot \hat{\varvec{q}}_3}{\hat{\varvec{q}}_2\times \hat{\varvec{q}}_3 \cdot \hat{\varvec{c}}_\oplus }, \quad a_{3} = \frac{q_2\left( \hat{\varvec{\rho }}_1\times \hat{\varvec{\rho }}_2 \cdot \varvec{q}_1\right) }{T_3\sqrt{p_\oplus }} = \frac{\hat{\varvec{\rho }}_1\times \hat{\varvec{\rho }}_2 \cdot \hat{\varvec{q}}_1}{\hat{\varvec{q}}_1\times \hat{\varvec{q}}_2 \cdot \hat{\varvec{c}}_\oplus }, \end{aligned}$$

and the vectors

$$\begin{aligned} \varvec{\gamma } = a_{1}\left( \hat{\varvec{\rho }}_1 + \alpha _{13}\hat{\varvec{q}}_1\right) ,\quad \varvec{\varphi } = a_{3}\left( \hat{\varvec{\rho }}_3 + \alpha _{31}\hat{\varvec{q}}_3\right) , \end{aligned}$$

Equation (18) becomes

$$\begin{aligned} (\varvec{\gamma }-\varvec{\varphi })\cdot \left( \varvec{c}_\oplus -\varvec{c}\right) = 0. \end{aligned}$$
(22)

Remark 2

Choosing the index pair \(\{(2,3,1),(1,3,2)\}\) or \(\{(3,1,2),(2,1,3)\}\) in (14) we can obtain two additional equations analogous to (22). However, with the employed approximation of \(\varvec{T}\) and \(\varvec{\tau }\), the three equations are just the same (see Mossotti 1942, Sect. 28).

2.5 Mossotti’s equations for \(\varvec{c}_\oplus -\varvec{c}\)

With the aim of writing two independent linear equations, all the four observations are used. If we consider two different choices of the three observations, out of the available four, we obtain the system

$$\begin{aligned} \left\{ \begin{aligned}&(\varvec{\gamma }_1-\varvec{\varphi }_1)\cdot \left( \varvec{c}_\oplus -\varvec{c}\right) = 0\\&(\varvec{\gamma }_2-\varvec{\varphi }_2)\cdot \left( \varvec{c}_\oplus -\varvec{c}\right) = 0, \end{aligned} \right. \end{aligned}$$
(23)

where the subscripts 1, 2 of \(\varvec{\gamma }\) and \(\varvec{\varphi }\) refer to the two triplets of observations. Set

$$\begin{aligned} \varvec{w} = (\varvec{\gamma }_1-\varvec{\varphi }_1)\times (\varvec{\gamma }_2-\varvec{\varphi }_2) \end{aligned}$$

and assume \(\varvec{w}\ne \varvec{0}\). Then, the general solution of (23) has the form

$$\begin{aligned} \varvec{c}_\oplus - \varvec{c} = \lambda \varvec{w}, \end{aligned}$$
(24)

with \(\lambda \in \mathbb {R}\), giving the direction of \(\varvec{c}_\oplus - \varvec{c}\).

Remark 3

If \(\varvec{\gamma }_1-\varvec{\varphi }_1\) and \(\varvec{\gamma }_2-\varvec{\varphi }_2\) are almost parallel, then system (23) is almost degenerate: we can try to avoid this singularity by choosing other triplets of observations.

In order to constrain the values of \(\lambda \) we proceed as follows. Choosing \((i,j,k) = (1,2,3)\) in Eq. (14), we have

$$\begin{aligned} \frac{a_3 c_\oplus \rho _2}{q_2} = (\varvec{c}_\oplus -\varvec{c})\cdot \left( \hat{\varvec{\rho }}_1+\alpha _{13}\hat{\varvec{q}}_1\right) . \end{aligned}$$
(25)

Inserting the expression (24) of the general solution in (25) and (4) with \(i=2\) we obtain, respectively,

$$\begin{aligned} \rho _2 = \lambda \frac{q_2}{a_1a_3c_\oplus }\varvec{w}\cdot \varvec{\gamma }_1, \end{aligned}$$
(26)

and

$$\begin{aligned} \lambda \varvec{w}\cdot \varvec{q}_2 = (\varvec{c}_\oplus - \lambda \varvec{w})\cdot \varvec{\rho }_2 = (\varvec{c}_\oplus - \lambda \varvec{w})\cdot \hat{\varvec{\rho }}_2\rho _2. \end{aligned}$$
(27)

Substituting the expression (26) of \(\rho _2\) in (27) yields a quadratic equation in \(\lambda \), which is here the only unknown:

$$\begin{aligned} \lambda \varvec{w}\cdot \varvec{q}_2 = \lambda \frac{q_2}{a_1a_3 c_\oplus }\left( \varvec{w}\cdot \varvec{\gamma }_1\right) \big [(\varvec{c}_\oplus - \lambda \varvec{w})\cdot \hat{\varvec{\rho }}_2\big ]. \end{aligned}$$

This equation can be written as

$$\begin{aligned} (\varvec{w}\cdot \varvec{\gamma }_1)(\varvec{w}\cdot \hat{\varvec{\rho }}_2) \lambda ^2 + \big [a_1a_3c_\oplus (\varvec{w}\cdot \hat{\varvec{q}}_2)-(\varvec{w}\cdot \varvec{\gamma }_1) (\varvec{c}_\oplus \cdot \hat{\varvec{\rho }}_2)\big ]\lambda = 0, \end{aligned}$$
(28)

whose solutions are

$$\begin{aligned}&\lambda = 0, \end{aligned}$$
(29a)
$$\begin{aligned}&\lambda = \frac{(\varvec{w}\cdot \varvec{\gamma }_1)(\varvec{c}_\oplus \cdot \hat{\varvec{\rho }}_2) - a_1a_3c_\oplus (\varvec{w}\cdot \hat{\varvec{q}}_2)}{(\varvec{w}\cdot \varvec{\gamma }_1)(\varvec{w}\cdot \hat{\varvec{\rho }}_2)}. \end{aligned}$$
(29b)

Substituting these expressions in (24) gives two possible values of the angular momentum \(\varvec{c}\). The solution (29a) yields \(\varvec{c}_\oplus =\varvec{c}\) and is usually discarded, so that (29b) is regarded as the only solution. In this way, the equations of Mossotti’s method can be considered linear.

3 Topocentric method

We first introduce some notation. Let us define \(\varvec{q}_\oplus \), \(\varvec{p}_\mathrm{obs}\) as the heliocentric position of the Earth center, and the geocentric position of the observer, respectively. The heliocentric positions of the observer and asteroid are

$$\begin{aligned} \begin{aligned}&\varvec{q}=\varvec{q}_\oplus +\varvec{p}_\mathrm{obs},\\&{\varvec{r}}= \varvec{q}+ {\varvec{\rho }}= \varvec{q}_\oplus + {\varvec{\rho }}_\mathrm{geo},\\ \end{aligned} \end{aligned}$$

where \({\varvec{\rho }}\), \({\varvec{\rho }}_\mathrm{geo}\) are the topocentric and geocentric positions of the asteroid, respectively (see Fig. 2).

Fig. 2
figure 2

Geocentric and topocentric point of view

3.1 Geometric relations

As in the geocentric case, we select three observations of the asteroid out of the available four, and introduce the matrices P, Q, R as in (1), but with a different interpretation for the vectors \(\varvec{\rho }\) and \(\varvec{q}\), where \(\varvec{\rho }\) represents the topocentric position of the asteroid, and \(\varvec{q}\) gives the heliocentric position of the observer. Moreover, we introduce the matrices

$$\begin{aligned} Q_\oplus = {\bigl (\varvec{q}_{\oplus ,1}\, |\, \varvec{q}_{\oplus ,2}\, |\, \varvec{q}_{\oplus ,3}\bigr )}, \quad P_\text {obs} = {\bigl (\varvec{p}_{\text {obs},1}\, |\, \varvec{p}_{\text {obs},2}\, |\, \varvec{p}_{\text {obs},3}\bigr )}. \end{aligned}$$

We recall the geometrical relations

$$\begin{aligned} \varvec{c}\cdot \varvec{r}_i = \varvec{c}\cdot (\varvec{q}_i+{\varvec{\rho }}_i)= 0,\quad \varvec{c}_\oplus \cdot \varvec{q}_{\oplus ,i} = \varvec{c}_\oplus \cdot (\varvec{q}_i - \varvec{p}_{\text {obs},i}) = 0, \end{aligned}$$

which lead us to

$$\begin{aligned} \varvec{c}\cdot {\varvec{\rho }}_i = (\varvec{c}_\oplus -\varvec{c})\cdot \varvec{q}_i - \varvec{c}_\oplus \cdot \varvec{p}_{\text {obs},i}, \end{aligned}$$
(30)

for \(i=1,2,3\). We also introduce the quantities

$$\begin{aligned} \varvec{\tau } = \frac{1}{\sqrt{p}} \text {adj}(R)\hat{\varvec{c}}, \quad \varvec{T} = \frac{1}{\sqrt{p_\oplus }} \text {adj}(Q_\oplus )\hat{\varvec{c}}_\oplus , \end{aligned}$$
(31)

which are the same as in (6), and define the matrix

$$\begin{aligned} \mathcal {C} = \left[ \text {adj}(Q)-\text {adj}(Q_\oplus )\right] \hat{P}. \end{aligned}$$
(32)

Like in the geocentric case, we have the relations

$$\begin{aligned} \varepsilon _{ijk}\frac{\tau _k}{\kappa }\varvec{c}\cdot \varvec{\rho }_i&= \varvec{q}_i\times \varvec{q}_j \cdot \varvec{\rho }_i - \varvec{\rho }_i\times \varvec{\rho }_j \cdot \varvec{q}_i, \end{aligned}$$
(33)
$$\begin{aligned} \varepsilon _{ijk}\frac{T_k}{\kappa }\varvec{c}_\oplus \cdot \varvec{\rho }_i + {{\mathcal {C}}}_{ki}\rho _i&= \varvec{q}_i\times \varvec{q}_j \cdot \varvec{\rho }_i, \end{aligned}$$
(34)

where (ijk) is varied so that all the 6 permutations of the set \(\{1,2,3\}\) are considered, and \(\mathcal {C}_{ki}\) denotes the element of the k-th row and i-th column of the matrix \(\mathcal {C}\).

Subtracting (33) from (34), we obtain

$$\begin{aligned} \varepsilon _{ijk}\frac{1}{\kappa }(T_k\varvec{c}_\oplus -\tau _k\varvec{c})\cdot \varvec{\rho }_i + {{\mathcal {C}}}_{ki}\rho _i= \varvec{\rho }_i\times \varvec{\rho }_j \cdot \varvec{q}_i, \end{aligned}$$
(35)

and following the same procedure as in the geocentric case, we can write (35) as

$$\begin{aligned} \varepsilon _{ijk}\left[ \left( \varvec{c}_\oplus -\varvec{c}\right) \cdot \left( \hat{\varvec{\rho }}_i+\alpha _{ik}\hat{\varvec{q}}_i\right) + \frac{\kappa \mathcal {C}_{ki}}{T_k}-\frac{\alpha _{ik}}{q_i}\varvec{c}_\oplus \cdot \varvec{p}_{\text {obs},i}\right] T_k = \kappa \rho _j\hat{\varvec{\rho }}_i\times \hat{\varvec{\rho }}_j\cdot \varvec{q}_i, \end{aligned}$$
(36)

where the coefficients \(\alpha _{ik}\) are defined as in (13), with the new interpretation for \(\rho _i\) and \(q_i\).

Using the same approximations as in (15), we also note that

$$\begin{aligned} \begin{aligned} \text {adj}(\hat{P})P\varvec{\theta }&\simeq \text {adj}(\hat{P})P\varvec{\tau } = \text {adj}(\hat{P})Q_\oplus (\varvec{T}-\varvec{\tau }) - \text {adj}(\hat{P})P_\text {obs}\varvec{\tau }\\&\simeq \frac{1}{6}\bigg (\frac{1}{r_2^3} - \frac{1}{q_{\oplus ,2}^3}\bigg )\varvec{u} - \text {adj}(\hat{P})P_\text {obs}\varvec{\theta }, \end{aligned} \end{aligned}$$
(37)

where \(q_{\oplus ,2} = |\varvec{q}_{\oplus ,2}|\) and

$$\begin{aligned} \varvec{u} = (u_1,u_2,u_3)^t = \text {adj}(\hat{P})Q_\oplus \varvec{\theta }^3. \end{aligned}$$

The presence of the term \(\text {adj}(\hat{P})P_\text {obs}\varvec{\theta }\) in (37) prevents us from making the same simplification that allowed to express the \(\alpha _{ik}\) as functions of known quantities. Noting that

$$\begin{aligned} \text {adj}(\hat{P})P_\text {obs} = \mathcal {O}(p_\text {obs}), \end{aligned}$$

where \(p_{\text {obs}} = |\varvec{p}_{\text {obs}}|\), we neglect this term and obtain

$$\begin{aligned} \det (\hat{P})\varvec{\delta }\odot \varvec{\theta } \simeq \frac{1}{6}\left( \frac{1}{r_2^3} - \frac{1}{q_{\oplus ,2}^3}\right) \varvec{u}, \end{aligned}$$

with

$$\begin{aligned} \varvec{\delta } = (\rho _1,\rho _2,\rho _3)^t. \end{aligned}$$

As a consequence, the expressions for \(\alpha _{13}\) and \(\alpha _{31}\) given in (21) can still be used, with the new interpretation for the vectors \(\varvec{\rho }\) and \(\varvec{q}\).

3.2 A linear equation involving \(\varvec{c}_\oplus -\varvec{c}\)

Choosing \((i,j,k)=(1,2,3), (3,2,1)\) in (36) and eliminating \(\kappa \rho _2\) from the resulting equations, we obtain

$$\begin{aligned} \begin{aligned}&a_3 \left[ \left( \varvec{c}_\oplus -\varvec{c}\right) \cdot \left( \hat{\varvec{\rho }}_3 +\alpha _{31}\hat{\varvec{q}}_3\right) + \frac{\kappa \mathcal {C}_{13} }{T_1} -\frac{\alpha _{31}}{q_3}\varvec{c}_\oplus \cdot \varvec{p}_{\text {obs},3} \right] \\&\quad = a_1\left[ (\varvec{c}_\oplus -\varvec{c})\cdot \left( \hat{\varvec{\rho }}_1 +\alpha _{13}\hat{\varvec{q}}_1\right) + \frac{\kappa \mathcal {C}_{31}}{T_3} -\frac{\alpha _{13}}{q_1}\varvec{c}_\oplus \cdot \varvec{p}_{\text {obs},1} \right] , \end{aligned} \end{aligned}$$
(38)

with

$$\begin{aligned}&a_{1} = \frac{\left( \hat{\varvec{\rho }}_2\times \hat{\varvec{\rho }}_3 \cdot \varvec{q}_3 \right) {q}_2}{T_1\sqrt{p_\oplus }} = \frac{\left( \hat{\varvec{\rho }}_2\times \hat{\varvec{\rho }}_3 \cdot \varvec{q}_3 \right) {q}_2}{\varvec{q}_{\oplus ,2}\times \varvec{q}_{\oplus ,3} \cdot \hat{\varvec{c}}_\oplus } , \nonumber \\&a_{3} = \frac{\left( \hat{\varvec{\rho }}_1\times \hat{\varvec{\rho }}_2 \cdot \varvec{q}_1 \right) {q}_2}{T_3\sqrt{p_\oplus }} = \frac{\left( \hat{\varvec{\rho }}_1\times \hat{\varvec{\rho }}_2 \cdot \varvec{q}_1 \right) {q}_2}{\varvec{q}_{\oplus ,1}\times \varvec{q}_{\oplus ,2} \cdot \hat{\varvec{c}}_\oplus }. \end{aligned}$$
(39)

Like in the geocentric case, defining the vectors

$$\begin{aligned} \varvec{\gamma } = a_{1}\left( \hat{\varvec{\rho }}_1 + \alpha _{13}\hat{\varvec{q}}_1\right) ,\quad \varvec{\varphi } = a_{3}\left( \hat{\varvec{\rho }}_3 + \alpha _{31}\hat{\varvec{q}}_3\right) , \end{aligned}$$

we can write Eq. (38) as

$$\begin{aligned} (\varvec{\gamma }-\varvec{\varphi })\cdot \left( \varvec{c}_\oplus -\varvec{c}\right) = D, \end{aligned}$$
(40)

with

$$\begin{aligned} \begin{aligned} D&= \kappa \left( \frac{a_3\mathcal {C}_{13}}{T_1} - \frac{a_1\mathcal {C}_{31}}{T_3}\right) +\left( \frac{a_1\alpha _{13}}{q_1}\varvec{p}_{\text {obs},1} - \frac{a_3\alpha _{31}}{q_3}\varvec{p}_{\text {obs},3} \right) \cdot \varvec{c}_\oplus . \end{aligned} \end{aligned}$$

Remark 4

We can add to (40) two equations choosing the index pairs \(\{(2,3,1),(1,3,2)\}\) and \(\{(3,1,2),(2,1,3)\}\) in (36). In this way, we can write a linear system of three equations for \(\varvec{c}_\oplus -\varvec{c}\) using only three observations. However, we expect that the matrix of this system is ill-conditioned because \(Q = Q_\oplus + \mathcal {O}(p_\mathrm{obs})\), so that the vectors \(\varvec{q}_1, \varvec{q}_2, \varvec{q}_3\) are almost coplanar.

3.3 Equations of the topocentric method

Following Sect. 2.5, if we consider two different choices of the three observations, we obtain the system

$$\begin{aligned} \left\{ \begin{aligned}&(\varvec{\gamma }_1-\varvec{\varphi }_1)\cdot \left( \varvec{c}_\oplus -\varvec{c}\right) = D_1\\&(\varvec{\gamma }_2-\varvec{\varphi }_2)\cdot \left( \varvec{c}_\oplus -\varvec{c}\right) = D_2. \end{aligned} \right. \end{aligned}$$
(41)

Set

$$\begin{aligned} \varvec{w} = (\varvec{\gamma }_1-\varvec{\varphi }_1)\times (\varvec{\gamma }_2-\varvec{\varphi }_2), \end{aligned}$$

and assume \(\varvec{w}\ne \varvec{0}\). Then, the general solution of (41) takes the form

$$\begin{aligned} \varvec{c}_\oplus - \varvec{c} = \lambda \varvec{w} + \varvec{g}, \end{aligned}$$
(42)

where \(\lambda \in \mathbb {R}\), and \(\varvec{g}\) is a particular solution of (41), e.g., the one fulfilling \(\varvec{g}\cdot \varvec{w}=0\).

In order to constrain the values of \(\lambda \), we proceed as follows. Note that we can write (36) with \((i,j,k) = (1,2,3)\) as

$$\begin{aligned} \frac{a_3 c_\oplus \rho _2}{q_2} = \left( \varvec{c}_\oplus -\varvec{c}\right) \cdot \left( \hat{\varvec{\rho }}_1 +\alpha _{13}\hat{\varvec{q}}_1\right) + \frac{\kappa \mathcal {C}_{31}}{T_3} -\frac{\alpha _{13}}{q_1}\varvec{c}_\oplus \cdot \varvec{p}_{\text {obs},1}. \end{aligned}$$
(43)

Inserting the general solution (42) into (43) and (30) with \(i=2\), we obtain

$$\begin{aligned} {\rho }_2 = \frac{1}{b}(\lambda \varvec{w}+\varvec{g})\cdot \varvec{\gamma }_1 + f, \end{aligned}$$
(44)

where

$$\begin{aligned} b = \frac{a_1a_3c_\oplus }{q_2}, \quad f=\frac{q_2}{a_3c_\oplus }\left( \frac{\kappa \mathcal {C}_{31}}{T_3} -\frac{\alpha _{13}}{q_1}\varvec{c}_\oplus \cdot \varvec{p}_{\text {obs},1} \right) , \end{aligned}$$
(45)

and

$$\begin{aligned} \left( \varvec{c}_\oplus - \lambda \varvec{w} - \varvec{g}\right) \cdot {\hat{{\varvec{\rho }}}}_2\,\rho _2 = (\lambda \varvec{w} + \varvec{g})\cdot \varvec{q}_2 - \varvec{c}_\oplus \cdot \varvec{p}_{\text {obs},2}. \end{aligned}$$
(46)

Substituting the expression (44) of \(\rho _2\) in (46), we get

$$\begin{aligned} \left( \varvec{c}_\oplus - \lambda \varvec{w} - \varvec{g}\right) \cdot {\hat{{\varvec{\rho }}}}_2\left[ \frac{1}{b}(\lambda \varvec{w}+\varvec{g})\cdot \varvec{\gamma }_1 + f \right] = (\lambda \varvec{w} + \varvec{g})\cdot \varvec{q}_2 - \varvec{c}_\oplus \cdot \varvec{p}_{\text {obs},2}, \end{aligned}$$

which can be written as

$$\begin{aligned} \begin{aligned}&(\varvec{w}\cdot {\varvec{\gamma }}_1)(\varvec{w}\cdot \hat{\varvec{\rho }}_2)\lambda ^2 +\bigl [(\varvec{w}\cdot {\varvec{q}}_2)b - (\varvec{w}\cdot \varvec{\gamma }_1)(\varvec{c}_\oplus -\varvec{g})\cdot \hat{\varvec{\rho }}_2 + (\varvec{w}\cdot {\hat{\varvec{\rho }}}_2)(\varvec{g}\cdot \varvec{\gamma }_1 + b f)\bigr ]\lambda \\&\quad + b\left( \varvec{g}\cdot {\varvec{q}_2}-\varvec{c}_\oplus \cdot \varvec{p}_\mathrm{{obs},2}\right) - \left( \varvec{g}\cdot \varvec{\gamma }_1+ b f\right) (\varvec{c}_\oplus -\varvec{g})\cdot \hat{\varvec{\rho }}_2 = 0. \end{aligned} \end{aligned}$$
(47)

Equation (47) can be compared with Eq. (28). It is worth noting that in the topocentric formulation we do not have the solution \(\lambda = 0\) as in Mossotti’s original method, so that this formulation leads to a quadratic equation.

Remark 5

In the topocentric case, we could add a third linear equation to system (41) by choosing three different triplets of observations, out of the available four. However, we expect that also in this case the system is ill-conditioned, because the vectors \(\varvec{q}_1,\varvec{q}_2,\varvec{q}_3,\varvec{q}_4\) are almost coplanar.

4 Mossotti’s method for space debris

We can follow the same scheme introduced in Sect. 3 for the computation of the orbits of space debris, assuming that the Earth is spherical and rotates with uniform angular velocity. In this case, we use the rescaled time

$$\begin{aligned} \theta = \kappa _\oplus t, \end{aligned}$$

with \(\kappa _\oplus = \sqrt{Gm_\oplus }\). Here, the vector \(\varvec{q}\) represents the geocentric positions of the observer, \(\varvec{r}\) and \({\varvec{\rho }}= \varvec{r}- \varvec{q}\) give the geocentric and topocentric positions of the debris, and \(\varvec{c}={\varvec{r}}\times {\dot{{\varvec{r}}}}\) is its orbital angular momentum. We consider the orthogonal decomposition

$$\begin{aligned} \varvec{q}= \varvec{q}^\perp + \varvec{p}_\mathrm{obs}, \end{aligned}$$

with

$$\begin{aligned} \varvec{q}^\perp = ({\varvec{e}}_3\times \varvec{q})\times {\varvec{e}}_3, \quad \varvec{p}_\mathrm{obs}= (\varvec{q}\cdot {\varvec{e}}_3){\varvec{e}}_3, \end{aligned}$$

where \({\varvec{e}}_3\) is the unit vector of the Earth rotation axis, see Fig. 3. Moreover, we introduce the vector

$$\begin{aligned} \varvec{c}_\mathrm{obs}= \varvec{q}^\perp \times \frac{d}{dt}{\varvec{q}}^\perp = \varvec{q}^\perp \times \dot{\varvec{q}}, \end{aligned}$$

where the last equality holds because \(\varvec{p}_\mathrm{obs}\) is constant.

Fig. 3
figure 3

Geometry of observations of space debris

We can write a quadratic equation analogous to (47) simply by substituting the vectors \(\varvec{c}_\oplus \), \(\varvec{q}_\oplus \) with \(\varvec{c}_\mathrm{obs}\), \(\varvec{q}^\perp \), and the parameter \(p_\oplus \) with \(|\varvec{q}^\perp |\).

5 Numerical tests

In this section, we test the performance of Mossotti’s original geocentric method (see Sect. 2) and its topocentric version introduced in Sect. 3. In the following, we denote the former by M\(_{\mathrm{geo}}\) and the latter by M\(_{\mathrm{top}}\).

In Sects. 5.1, 5.2, we compare M\(_{\mathrm{geo}}\) with M\(_{\mathrm{top}}\) using simulated observations (right ascension and declination) computed for the site of the Pan-STARRS1 telescope, mount Haleakala, Hawaii, without taking into account observability conditions, i.e., the asteroids are not necessarily visible in the night sky. Moreover, we assume that the four observations given in input to M\(_{\mathrm{geo}}\) and M\(_{\mathrm{top}}\) are equally spaced in time. The time interval \(\Delta t\) between two consecutive observations is varied in the two intervals:

$$\begin{aligned} I_1=[15,200]\text { min,}\quad I_2=[0.25,100]\text { days.} \end{aligned}$$
(48)

The comparison is based on the computation of the angular momentum vector \(\mathbf{c}\) and the following related quantities: its magnitude \(c = |\mathbf{c}|\) and direction \(\hat{\mathbf{c}}=\mathbf{c}/c\), the longitude of the (ascending) node \(\Omega \), and the inclination i. We denote by \(x_t\) the true value of the quantity x, and by \(x_{\mathrm{geo}}\), \(x_{\mathrm{top}}\) the values of x computed by M\(_{\mathrm{geo}}\), M\(_{\mathrm{top}}\), respectively. Note that while M\(_{\mathrm{geo}}\) always produces one solution for \(\mathbf{c}\), the method M\(_{\mathrm{top}}\) can give two solutions. If this is the case, we select the one for which \(|\mathbf{c}_t-\mathbf{c}_{\mathrm{top}}|\) is smaller.

In Sect. 5.3, the method M\(_{\mathrm{top}}\) is compared to Gauss’s method for initial orbit determination. Synthetic data have been obtained that take into account the observability conditions and the expected real cadence of the observations from the Vera C. Rubin Observatory, which is currently under construction in Chile. With respect to the previous tests, we remark that for any set of four observations the time interval \(\Delta t\) is not constant.

5.1 Tests without astrometric errors

We consider simulated observations of the asteroid Vesta without astrometric error. The time \(\Delta t\) between two consecutive observations is varied in the two intervals \(I_1\), \(I_2\) specified in (48). Given \(\Delta t\), we select 10\(^5\) different initial epochs in a random way, and for each of them we generate four observations. For this purpose, we use the orbit of Vesta at the epoch 59200 MJD from the AstDyS-2 websiteFootnote 4 and propagate it to the desired epochs assuming Keplerian motion. The angular momentum vector defined by the Keplerian orbit is assumed to be the true solution (\(\mathbf{c}_t\)). Finally, we apply M\(_{\mathrm{geo}}\), M\(_{\mathrm{top}}\) to each set of four observations.

Let us first consider the case of short arcs of observation. For each \(\Delta t\) selected in \(I_1\), we compute the differences between the true values of the inclination (\(i_t\)), longitude of the node (\(\Omega _t\)), magnitude of the angular momentum vector (\(c_t\)) of Vesta, and the values obtained by either M\(_{\mathrm{geo}}\) or M\(_{\mathrm{top}}\). Some relevant statistical quantities related to these errors are shown in Fig. 4 as functions of \(\Delta t\). We note that the topocentric version of Mossotti’s method provides much better results than the original method. In particular, we observe that the performance of M\(_{\mathrm{top}}\) improves as \(\Delta t\) increases, and it stabilizes when the time interval is about 30 min. It is remarkable that for \(\Delta t\) larger than 25 min the error in the inclination is smaller than 0.01 degrees for the solutions that fall within the 1st and 3rd quartile and it is smaller than 0.1 degrees for the solutions that fall within the 5th and 95th percentile.

Fig. 4
figure 4

Statistics related to the errors in the inclination i, longitude of the node \(\Omega \), and magnitude of the angular momentum vector c, obtained by M\(_{\mathrm{geo}}\) (left) and M\(_{\mathrm{top}}\) (right) as functions of \(\Delta t\) in the case of observations of Vesta without astrometric error. In particular, we show the median (black line), the 1st and 3rd quartiles (blue and red lines), and the 5th and 95th percentiles (gray lines). The angles are in degrees and the \(\Delta t\) step is 1.5 s

Fig. 5
figure 5

Statistics related to the errors in the angular momentum vector \(\mathbf{c}\) and its direction \(\hat{\mathbf{c}}\), obtained by M\(_{\mathrm{geo}}\) and M\(_{\mathrm{top}}\) as functions of \(\Delta t\) in the case of observations of Vesta without astrometric error. In particular, we show the median (Q2), the 1st and 3rd quartiles (Q1, Q3), and the 95th percentile (P95). Here the \(\Delta t\) step is 15 min

We then allow \(\Delta t\) to take values in the interval \(I_2\), which corresponds to wider arcs of observation. For each \(\Delta t\) selected in \(I_2\), we compute the errors in the angular momentum vector and its direction. Statistical quantities related to these errors are shown in Fig. 5 as functions of \(\Delta t\). M\(_{\mathrm{top}}\) continues to show better results and a smoother behavior for values of \(\Delta t\) smaller than 30 days, even if the improvement over M\(_{\mathrm{geo}}\) is less pronounced with respect to that shown in Fig. 4. In such interval, the geocentric method is much more sensitive to \(\Delta t\) and large oscillations having a period of one day appear. This is due to not accounting for the topocentric position of the observer. We also notice that both methods exhibit an optimal performance for \(\Delta t\approx \) 3 weeks, and M\(_{\mathrm{top}}\) obtains in \(75\%\) of the solutions an error smaller than \(0.2\%\) and \(0.03\%\) in the vectors \(\mathbf{c}\) and \(\hat{\mathbf{c}}\), respectively.

If we consider time intervals longer than 30 days, the performance of the two methods is almost comparable, which is expected because the terms introduced in the topocentric version become smaller as \(\Delta t\) grows. The solutions with both methods deteriorate for \(\Delta t > 80\) days since they rely on Taylor’s expansions with respect to \(\Delta t\).

Table 1 Values of the inclination (i), longitude of the node (\(\Omega \)), magnitude of the angular momentum vector (c), obtained by M\(_{\mathrm{top}}\) and M\(_{\mathrm{geo}}\), with observations of Vesta not affected by astrometric error and with different time intervals between consecutive observations

In Table 1, we report the inclination, longitude of the node, and magnitude of the angular momentum vector obtained by M\(_{\mathrm{geo}}\) and M\(_{\mathrm{top}}\) from observations of Vesta without astrometric error, considering different values of \(\Delta t\) with the same epoch for the first observation. We also show for M\(_{\mathrm{geo}}\) with the label “2” the solution of Mossotti’s original method which is always discarded, i.e., the one for which the angular momenta of the asteroid and the Earth are equal. We observe that if \(\Delta t\) is large enough, the values of i and c of this spurious solution are close to the wrong solution from M\(_{\mathrm{top}}\) (also labeled by “2”). The same behavior is not observed for \(\Omega \) because the Earth orbital inclination is small and therefore small variations in i can cause large deviations in \(\Omega \). For \(\Delta t = 30\) min (0.02 days), the solution from M\(_{\mathrm{geo}}\) with label “1” is very close to the wrong solution from M\(_{\mathrm{top}}\): in this case, only M\(_{\mathrm{top}}\) gives values of i, \(\Omega \), c close to the true ones.

5.2 Tests with astrometric errors

The same numerical tests described in the previous section for the asteroid Vesta are carried out by introducing an astrometric error with zero mean and standard deviation (rms) of 0.1 arcsec in the values of right ascension and declination, which is typical of modern asteroid surveys like Pan-STARRS1. Neither M\(_{\mathrm{geo}}\) nor M\(_{\mathrm{top}}\) gives reliable results for \(\Delta t\) between 15 and 200 min. Indeed, determining a preliminary orbit from a single short arc is a challenging task, sometimes impossible without considering infinitely many solutions (Milani et al. 2005, 2004). However, the computation of a preliminary orbit can be performed by linking together two or more short arcs (e.g., Gronchi et al. 2015, 2017). On the other hand, both methods yield satisfactory results for time intervals between two consecutive observations larger than 8 h with only a slight degradation of their performance with the introduction of astrometric error (compare Figs. 5, 6). As in the case without astrometric error, M\(_{\mathrm{top}}\) is better than M\(_{\mathrm{geo}}\) for any considered \(\Delta t\). A smoother behavior of M\(_{\mathrm{top}}\) is observed for \(\Delta t\) smaller than 30 days. Both methods perform best for \(\Delta t \approx \) 3 weeks.

Fig. 6
figure 6

The same as Fig. 5 but the observations of Vesta are affected by astrometric error with zero mean and rms of 0.1 arcsec. Here, the \(\Delta t\) step is 15 min

In our third test, we use M\(_{\mathrm{geo}}\) and M\(_{\mathrm{top}}\) with synthetic observations of the 546077 numbered asteroids known to the date of October 19, 2020. Their orbital elements at the epoch 59200 MJD (from AstDyS-2) provide the true orbit and are used to simulate observations from the Pan-STARRS1 telescope site. The epoch of the first observation is obtained from 59200 MJD considering aberration correction; then, the subsequent three observations are simulated by Keplerian propagation. The time \(\Delta t\) is varied in the interval \(I_2\) given in (48), and the standard deviation of the astrometric error spans from 0 to 1 arcsec. Figures 78 show that for time intervals between 20 and 40 days both M\(_{\mathrm{geo}}\) and M\(_{\mathrm{top}}\) give good results. A closer look at this range of values of \(\Delta t\) for M\(_{\mathrm{geo}}\) reveals the same oscillations displayed in Figs. 5 and 6. On the other hand, M\(_{\mathrm{top}}\) smooths out such oscillations. The best performance of both M\(_{\mathrm{geo}}\) and M\(_{\mathrm{top}}\) is reached again for \(\Delta t\approx \) 3 weeks.

Fig. 7
figure 7

Performance of Mossotti’s original method (left) and its topocentric version (right) with observations of 546077 numbered asteroids. For each asteroid, we compute the difference between the true angular momentum vector \(\mathbf{c}_t\) at the epoch 59200 MJD, and the same quantity \(\mathbf{c}\) obtained by either M\(_{\mathrm{geo}}\) or M\(_{\mathrm{top}}\) on a uniform grid of values of \(\Delta t\) and rms of the astrometric error in the intervals \([0.25,\,100]\) days and \([0,\,1]\) arcsec. Colored representations of the median and the 95th percentile related to the error \(|\mathbf{c}_t-\mathbf{c}|/c_t\) are displayed. The step of \(\Delta t\) is 6 h, and the step of rms is 0.002 arcsec

Fig. 8
figure 8

The same as Fig. 7 for the error in the direction of the angular momentum vector

5.3 Tests on synthetic survey data without astrometric errors

Our final test introduces more realism into the simulations in an attempt to assess the performance of the topocentric version of Mossotti’s method (M\(_{\mathrm{top}}\)) on synthetic data that uses a realistic cadence and accounts for actual observability of the asteroids. With the motivation that Mossotti’s initial orbit determination method could be applied to linking observations of unknown asteroids in contemporary and future asteroid surveys, we applied M\(_{\mathrm{top}}\) to synthetic observations from the Vera Rubin Observatory’s (VRO) Legacy Survey of Space and Time (LSST) (Ivezić et al. 2019). They have developed a high-fidelity survey scheduler that will be employed in final operations but is currently being used to simulate and optimize the survey strategy (Connolly et al. 2014; Delgado and Reuter 2016; Naghib et al. 2019). We used a single survey simulation for one month of surveying that did not include any astrometric error. Then, we extracted the first four synthetic detections of all the detected numbered NEOs, Trojans, Centaurs, and TNOs, but only a small subset of the detected main belt objects so that they would not dominate our results. The epochs of observation were all within about 30 days, and the astrometric positions were generated with a full n-body integration. This process produced a set of four detections of 1535 objects distributed throughout the solar system. We then processed all the detections with both M\(_{\mathrm{top}}\) and our implementation of Gauss’s method, by limiting our search to bounded orbits only.

In our sample, Gauss’s method was able to produce orbits for 1493 objects (i.e., \(\sim 97\)%) and M\(_{\mathrm{top}}\) provided solutions for 1395 objects (i.e., \(\sim 91\)%). However, we had 59 occurrences of a negative discriminant of Eq. (47) with M\(_{\mathrm{top}}\), and for some of them, we were able to recover an acceptable orbit by setting the discriminant equal to zero. Doing so increases the number of solutions for M\(_{\mathrm{top}}\) to 1454 (i.e., \(\sim 95\)%).

While Gauss’s method can yield three different solutions for the same set of observations and M\(_{\mathrm{top}}\) can yield two, on this set of data they had multiple solutions for about 50% and 45% of the objects, respectively. Gauss’s method did not produce any orbit for 42 objects and M\(_{\mathrm{top}}\) for 81. Moreover, both of them failed in 30 of these cases. In 12 cases, M\(_{\mathrm{top}}\) was able to obtain at least one orbit, while Gauss’s method was not, and for some cases it found an acceptable orbit.

The primary benefit of M\(_{\mathrm{top}}\) is that in our limited testing it appears to be about 6 times faster than Gauss’s method. We repeated the orbit computation 1000 times for each of the 1535 objects using an Intel Xeon processor, with base clock 3.30 GHz: Gauss’s algorithm took \(\sim 77\) s, while M\(_{\mathrm{top}}\) \(\sim 13\) s.

Gauss’s technique provides better solutions for objects throughout the solar system (Fig. 9 and Table 2). When comparing the derived orbital elements with their actual values, we used the derived orbit solution that had the lowest 5-element D-criterionFootnote 5 relative to the actual orbit (Drummond 1981). A simple visual comparison of the results suggests that Gauss’s method is more likely to produce good orbital elements and less likely to yield wildly different values. Quantitative orbital element comparisons confirm this impression (Table 2).

Fig. 9
figure 9

All the panels present the derived orbital parameter on the y-axis versus the actual parameter value on the x-axis using error-free data from a VRO–LSST simulation. The top panels are for Gauss’s method, and the bottom panels use the topocentric version of Mossotti’s method. From left to right, the panels compare \(\log _{10}(a/[\mathrm{au}])\), where a is the semi-major axis, eccentricity, and inclination. The red reference line in each figure has a slope of 1

Table 2 Mean and rms of the difference between the actual and derived orbital elements for both Gauss’s method and the topocentric version of Mossotti’s method

6 Conclusions

In this paper, we have revisited Mossotti’s orbit determination method working with four geocentric observations of a celestial body, and extended it to the case of topocentric observations. While Mossotti’s method yields linear equations for the components of the angular momentum vector, the topocentric version leads to a quadratic equation. Numerical simulations with synthetic observations both without and with astrometric error show that the topocentric method improves the original one. Considering all the numbered asteroids, and generating for each of them four observations equally spaced in time, we find that both these methods show an optimal behavior for a time separation \(\Delta t\) between two consecutive observations of about 3 weeks. Finally, we compare the new method with Gauss’s method using synthetic observations without astrometrical error that reproduce the expected scheduling of the Vera Rubin Observatory’s (VRO) Legacy Survey of Space and Time (LSST), characterized by an average \(\Delta t\) of about 4 days. Gauss’s method provides good orbits for a larger number of objects than the topocentric version of Mossotti’s method, which, on the other hand, is faster.