1 Problem statement

Consider the following project: radial velocities of a number of faint binary star targets are to be obtained by queue scheduling of observing time. The scheduling will be such that measurements will be obtained at random phases. (For example, the periods may be much shorter than times between observations). How many visits N per target are required to ensure good phase coverage? It is assumed that the period P and the orbital eccentricity e of each of the stars is known from photometric measurements.

In general, the radial velocity is described by the set of equations

$$\begin{aligned} M= & {} \frac{2\pi }{P}(t-t_0)\nonumber \\ M= & {} E-e\sin ~E \nonumber \\ \theta= & {} 2 \tan ^{-1} \left[ \sqrt{\frac{1+e}{1-e}} \tan (E/2) \right] \nonumber \\ v_r= & {} \gamma +K [\cos (\theta +\omega )+e\cos \omega ] \end{aligned}$$
(1)

where \(\gamma \) is the systemic velocity, K the semi-amplitude, \(\theta =\theta (t)\) the true anomaly, \(\omega \) the argument of periastron, E the eccentric anomaly, M the mean anomaly, and \(t_0\) the time of periastron passage [e.g. Tatum [13] or https://www.astro.uvic.ca/~tatum/celmechs.html]. This serves as a basis for the material below.

It is well known that the eccentricity of binary orbits can be determined from light curves alone (e.g. [12]). Approximate eccentricities of exoplanetary orbits can also be calculated from photometry of transits. If more than one planet orbits a star, the transit timings will not be perfectly periodic. The transit timing variations can be used to infer, amongst other quantities, the eccentricities of the planetary orbits (e.g. [1, 4, 6]). The effects of eccentricity on the shape of transit light curves have also been exploited to derive a relationship between the eccentricity and the mean density of the host star (e.g. [2, 7], and references therein). Weak limits on the stellar density lead to sharp limits on the eccentricity of the planetary orbit.

The conclusion is that information (periods, eccentricities) gained from transit observations can be also used to plan radial velocity observations of exoplanet systems. Given that the velocities involved are much smaller than those of binary stars, the theory below pertains to the minimum number of epochs of observation, rather than the total number of velocity measurements.

2 Adequate phase coverage

The first task is to define what exactly is meant by “good" (or adequate) phase coverage. A sensible approach is to base the answer on controlling the estimation errors on the parameters of interest (systemic, velocity, velocity semi-amplitude and argument of periastron, if \(e=0\).)

A quantitative answer to the question can be given by noting that for circular orbits, the observed radial velocity is

$$v_r(t)=\gamma +K \cos (2\pi t/P +\omega ^\prime )+\varepsilon (t)$$

where \(\omega ^\prime \) is the phase at time \(t=0\) and \(\varepsilon \) represents measurement error. If the velocities are phased with respect to P,

$$\begin{aligned} v_r(t)= & {} \gamma +K \cos [2\pi \psi (t)+\phi ]+\varepsilon (t)\nonumber \\= & {} \gamma +A\cos [2\pi \psi (t)]+B\sin [2\pi \psi (t)]+\varepsilon (t) \end{aligned}$$
(2)

with \(0 \le \psi < 1\). Equation (2) is in standard linear regression form; it follows immediately that the covariance matrix C of the estimated values of \(\gamma \), A and B is

$$\begin{aligned} C = \sigma _\varepsilon ^2 (X^\prime X)^{-1} \equiv \sigma _\varepsilon ^2 W \end{aligned}$$
(3)

(e.g. [8]). In this case the design matrix X is

$$\begin{aligned} X=\begin{bmatrix}1&{} \cos 2\pi \psi _1 &{} \sin 2 \pi \psi _1 \\ \\ 1&{} \cos 2\pi \psi _2 &{} \sin 2 \pi \psi _2 \\ \\ \vdots &{} \vdots \\ \\ 1&{} \cos 2\pi \psi _N &{} \sin 2 \pi \psi _N \end{bmatrix}. \end{aligned}$$
(4)

The considerable advantage of the linear form in (2) is that X, and hence W, does not contain any unknowns: scale factor \(\sigma _\varepsilon \) aside, C is fully characterised by W.

Fig. 1
figure 1

The pattern of phase spacings used to derive the results in Fig. 2: a regular grid spacing covering the interval [0, 1), with a single large gap

The statistical field of optimal design is concerned with controlling (as much as possible) the entries in X in order to minimise the variances appearing in the matrix C. To this end, various measures based on the matrix W has been suggested. Here, we will refer to only two popular choices: A-optimality, which entails minimising the trace of W, and D-optimality, which minimises the determinant of W (e.g. [10]).

Figures 1 and 2 illustrate application of the two optimality criteria to the problem at hand. Proceeding from a regular grid of phase values \(\psi \) spanning the interval [0, 1), gaps are introduced, as plotted in Fig. 1. Gaps are increased in length, and for each “design" the matrix W is calculated. The traces and determinants of W are plotted in Fig. 2, normalised with respect to the values for zero gap length. As expected, trace(W) and |W| increase with increasing gap lengths, demonstrating the sub-optimality with respect to the complete set of phases \(\psi \).

Fig. 2
figure 2

The effect of large gaps in a regularly spaced coverage of the phase interval [0, 1), as in Fig. 1. Top panel: the A-optimal criterion. Bottom panel: D-optimal criterion. Dots and circles respectively show results for full observation grids of 20 and 60 points

The interpretation of the A-optimality criterion is more transparent: it is essentially the sum of the three variances (of the estimated values of \(\gamma \), A and B). For gap lengths longer than 0.45, trace\((W)>5\), i.e. a five-fold increase in the variance. The precise choice will obviously be dependent on the context. Here, illustrative numerical values will be given assuming maximum acceptable phase gaps of 0.4.

3 Probability of large gaps

It is convenient to think of phase as being defined on a circle with unit circumference, so that \(\psi \) and \(\psi +k\), with k being any integer, refer to the same radial velocity phases. The phase of each radial velocity measurement corresponds uniquely to a point placed randomly on the unit circle. The definition of good phase coverage is then equivalent to placing a limit \(\ell \) on the largest arc length between any two points on the unit circle. This problem appears to have been first addressed by Stevens [11]. We will use the result (e.g. [5])

$$\begin{aligned} F(\ell )=\sum _{j=0}^L (-1)^j {N \atopwithdelims ()j} (1-j\ell )^{N-1} \end{aligned}$$
(5)

where F is the cumulative distribution function (CDF), and \(L=\lfloor 1/\ell \rfloor \) is the greatest integer smaller than, or equal to, \(1/\ell \).

The probability density function (PDF) corresponding to (5) is

$$\begin{aligned} f(\ell )=N(N-1)\sum _{j=1}^L (-1)^{j-1} {N-1 \atopwithdelims ()j-1} (1-j\ell )^{N-2} \; . \end{aligned}$$
(6)

A few examples are plotted in Fig. 3. As intuition would suggest, PDFs are more compact and concentrated at smaller \(\ell \) as N increases. Note that gaps of length 0.4 are unlikely by the time \(N=15\), and seem virtually impossible at \(N=25\).

Fig. 3
figure 3

Probability density functions of \(\ell \) [Equation (6)]. From right to left, \(N=5, 10, 15, 25\)

Two notes: first, although the shapes of the PDFs in Fig. 3 resemble those of skew normal distributions, the standardised skewness coefficient \(\gamma _1<1\) for the latter, while the skewness associated with the PDF f slightly exceeds unity for \(N \ge 10\). Second, Holst [5], amongst others, discussed the asymptotic distribution of the largest arc length. Although this has a conveniently simple form, numerical experiments show that it is not a good approximation to (5) for the relatively small values of N considered here.

For \(0.34< \ell < 0.5\), \(L=2\) and

$$F(\ell )=1-(N-1)(1-\ell )^{N-1}+\frac{1}{2}N(N-1)(1-2\ell )^{N-1} \; .$$

For \(\ell =0.4\), \(F(\ell )=0.989\) for \(N=15\), i.e. in only 1% of cases will phase gaps larger than 0.4 be encountered if \(v_r\) is measured 15 times. A 99.9% certainty that \(\ell <0.4\) requires \(N=21\).

4 Several stars

The treatment up to this point has focussed on single targets. Consider a group of M stars, each with N measurements of \(v_r\) taken at random phases. Let p be the probability of an unacceptably large phase gap for a single star. Then the probability that m of the M stars will have poorly determined parameters has a binomial distribution:

$$\begin{aligned} P(m=x)={M \atopwithdelims ()x} p^x (1-p)^{M-x} \end{aligned}$$
(7)

Table 1 contains some illustrative results: with \(N=21\) random measurements per star there is an reasonable (95%) chance that all M datasets will be acceptable (at least for \(M \le 50\)).

Table 1 Probabilities of finding 0, 1 or 2 stars with inadequate phase coverage (largest gap \(\ge 0.4\))

5 Non-circular orbits

The problem is somewhat more complicated if the eccentricity \(e \ne 0\). Note first though that the last of equations (1) can still be written as a linear regression, albeit in terms of \(\theta \), rather than t or \(\psi \):

$$v_r[\theta (t)]=\gamma ^\prime +K\cos [\theta (t)+\omega ]+\varepsilon (t)$$

with \(\gamma ^\prime =\gamma +e\cos (\omega )\). This means that the discussion in Section 2 applies, with a limit on the maximum gap in the phased values of true anomaly to be specified. We assume the same numerical value, 0.4, as in Section 2.

The main problem introduced by the non-zero eccentricity is that, contrasting with the case \(e=0\), the random time points of observation do not map into a uniform distribution of phase values. Instead, the distribution is given by the Jacobian of the transformation from \(\psi \) to \(\theta \) (e.g. [9]):

$$\begin{aligned} f_\theta (\theta )= & {} \left| \frac{d\psi }{d\theta } \right| \nonumber \\= & {} \left| \frac{d\psi }{dE}\frac{dE}{d\theta } \right| \nonumber \\= & {} \frac{1}{2\pi }(1-e\cos ~E)\frac{d}{dt} \left\{ 2 \tan ^{-1} \left[ \alpha \tan (\theta /2) \right] \right\} \nonumber \\= & {} \frac{1}{2\pi }\left\{ 1-e\cos \left[ 2\tan ^{-1} (\alpha \tan (\theta /2)\right] \right\} \frac{\alpha }{\cos ^2(\theta /2)+ \alpha ^2 \sin ^2 (\theta /2)}\nonumber \end{aligned}$$

where \(\alpha \equiv \sqrt{(1-e)/1+e)}\) and \(-\pi \le \theta < \pi \). It is slightly more convenient to work in terms of \(0 \le \phi =(\theta +2\pi )/2\pi <1\), in which case

$$\begin{aligned} f_\phi (\phi )=\left\{ 1-e\cos \left[ 2\tan ^{-1} (\alpha \cot (\pi \phi )\right] \right\} \frac{\alpha }{\sin ^2(\pi \phi )+ \alpha ^2 \cos ^2 (\pi \phi )} \; . \end{aligned}$$
(8)

A few examples of the largest gap PDFs resulting from (8) are plotted in Fig. 4. In order to produce these, many datasets of size N were simulated from (8), and the largest gap in the ordered phases \(\phi \) for each determined. PDFs were then estimated by a kernel density procedure applied to the collection of largest gaps. Unsurprisingly, considerably larger N, as compared with the \(e=0\) case, are required to thoroughly cover the full phase range (compare Fig. 3). This point is further explored in Table 2 which gives, for a variety of eccentricities, values of N needed to guarantee acceptably small phase gaps.

Fig. 4
figure 4

Simulated probability density functions of \(\ell \) for elliptical orbits. Top panel: \(e=0.2\); from right to left \(N=10, 20, 30, 50\). Bottom panel: \(e=0.5\); from right to left \(N=15, 30, 45, 60\)

Table 2 The number of observation N required to ensure a probability p of a maximum gap \(\ell =0.4\) in the true anomaly \(\theta \), for different values of the eccentricity e

6 An illustation

The central point of this paper is illustrated by subsampling radial velocity measurements of the planet-hosting star GJ 3988 [3]. The \(P=6.944\) d orbit of GJ 3988b is circular, so that the discussion in Section 3 applies.

Gorrini et al. [3] obtained 164 velocity measurements, of which three values deviating more than 15 ms\(^{-1}\) from the mean were discarded. Small subsamples were randomly drawn from the remaining 161 velocities, such that the times of observation were at least \(P/2=3.472\) d apart. For each subsample the velocity semi-amplitude K was calculated, and the maximum phase gap \(\ell \) noted. Of interest is the dependence on \(\ell \) of the uncertainty \(\sigma _K\) of the semi-amplitude. This was found be drawing many (\(10^5\)) subsamples for a given value of N, binning together those K with similar values of \(\ell \), and calculating the bin standard deviations. The results can be seen in Fig. 5, which shows the expected monotonic increases of \(\sigma _K\) with \(\ell \).

Fig. 5
figure 5

Standard errors \(\sigma _K\) on the velocity semi-amplitude estimated from subsamples of measurements by Gorrini et al. [3]. Squares, circles and diamonds respectively denote subsamples of size \(N=16, 18\) and 20

7 Concluding remarks

In order to keep the discussion above as simple and general as possible, fitting radial velocity models to observations has been treated as a linear least squares problem. In practice, this would not lead to direct estimation of quantities of interest such as K and \(\omega \). In order to estimate these, and other parameters explicitly, nonlinear regression is required, and error estimates would depend on the parameters of each individual system.

In general, Equations (7) and (5) should provide a good guideline for the minimum number of visits to each target. Adhering to those criteria will help guard against estimation errors induced by insufficient phase coverage of binary/exoplanet orbits. In implementation, Equation (7) can first be used for the specified number of targets M to place a limit on the probability p. The number of visits N can then be solved for from Equation (5). Of course, objects with substantially eccentric orbits will require more visits, and will have to be treated on an individual basis. Furthermore, in the case of exoplanets, random errors (corresponding to the scale factor \(\sigma _\varepsilon \)) will play a more important role, necessitating repeated measurements at each target visit.