1 Introduction

In this paper, we develop and study efficient schemes for covering and quantization in high-dimensional cubes. In particular, we will demonstrate that the proposed schemes are much superior to the so-called low-discrepancy sequences. The paper starts with introducing the main notation, then we formulate the main problem of covering a d-dimensional cube by n Euclidean balls. This is followed by a discussion on the main principles we have adopted for construction of our algorithms. Then we briefly formulate problems of covering a cube by smaller cubes (which are balls in the \(L_{\infty }\)-norm) and the problem of quantization. Both problems have many similarities with the main problem of covering a cube by n balls. At the end of this section, we describe the structure of the remaining sections of the paper and summarize our main findings.

1.1 Main Notation

  • \(\mathbb {R}^{d}\): d-dimensional space; vectors in \(\mathbb {R}^{d}\) are row-vectors;

  • ∥⋅∥ and \(\|\cdot \|_{\infty }\): Euclidean and \(L_{\infty }\)-norms in \(\mathbb {R}^{d}\);

  • \({{\mathscr{B}}}_{d}(Z,{ r })= \{ Y \in \mathbb {R}^{d}: \| Y-Z \| \leq r\}\): d-dimensional ball of radius r centred at \(Z \in \mathbb {R}^{d}\);

  • \({{\mathscr{B}}}_{d}({ r })= {{\mathscr{B}}}_{d}(0,{ r })=\{ Y \in \mathbb {R}^{d}: \| Y \| \leq { r }\}\);

  • \({\mathcal {S}}_{d}(Z,r)= \{ Y \in \mathbb {R}^{d}: \| Y-Z \| = { r } \}\): d-dimensional sphere of radius r centred at \(Z \in \mathbb {R}^{d}\);

  • \({\mathcal {C}}_{d}(Z,\delta )=\{ Y \in \mathbb {R}^{d}: \| Y -Z\|_{\infty } \leq { \delta } \}\): d-dimensional cube of side length 2δ centred at Z (it is also the d-dimensional ball in the \(L_{\infty }\)-norm with radius δ and centre Z);

  • \({\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}={\mathcal {C}}_{d}(0,\delta )\);

  • \({\mathcal {C}}_{d}=[-1,1]^{d}={\mathcal {C}}_{d}(1) \).

1.2 Main Problem of Interest

The main problem discussed in the paper is the following problem of covering a cube by n balls. Let \({\mathcal {C}}_{d}=[-1,1]^{d}\) be a d-dimensional cube, Z1, … , Zn be some points in \(\mathbb {R}^{d}\) and \({{\mathscr{B}}}_{d}(Z_{j},r)\) be the corresponding balls of radius r centred at Zj(j = 1, … , n). The dimension d, the number of balls n and their radius r could be arbitrary.

We are interested in the problem of choosing the locations of the centres of the balls Z1, … , Zn so that the union of the balls \(\cup _{j}{{\mathscr{B}}}_{d}(Z_{j},r)\) covers the largest possible proportion of the cube \({\mathcal {C}}_{d}\). That is, we are interested in choosing a design (a collection of points) \(\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}\) so that

$$ C_{d}(\mathbb{Z}_{n},r):= \text{vol} ({\mathcal{C}}_{d} \cap {\mathcal{B}}_{d}(\mathbb{Z}_{n},r))/2^{d} $$
(1)

is as large as possible (given n, r and the freedom we are able to use in choosing Z1, … , Zn). Here

$$ {\mathcal{B}}_{d}(\mathbb{Z}_{n},r)= \bigcup_{j=1}^{n} {\mathcal{B}}_{d}(Z_j,r) $$
(2)

and \(C_{d}(\mathbb {Z}_{n},r)\) is the proportion of the cube \({\mathcal {C}}_{d}\) covered by the balls \({{\mathscr{B}}}_{d}(Z_{j},r)(j=1, \ldots , n)\). If points \(Z_{j} \in \mathbb {Z}_{n}\) are random then by \(C_{d}(\mathbb {Z}_{n},r)\) we will mean \(\mathbb {E}_{\mathbb {Z}_{n}}C_{d}(\mathbb {Z}_{n},r)\) but we are not going to stress this in notation unless it is important.

For a design \(\mathbb {Z}_{n}\), its covering radius is defined by CR\((\mathbb {Z}_{n})= \max \limits _{X \in {\mathcal {C}}_{d}} \min \limits _{Z_{j} \in \mathbb {Z}_{n}}\|X-Z_{j}\|\). In computer experiments, covering radius is called minimax-distance criterion (see [5] and [13]); in the theory of low-discrepancy sequences, covering radius is called dispersion (see [8, Ch. 6]).

The problem of optimal covering of a cube by n balls has very high importance for the theory of global optimization and many branches of numerical mathematics. In particular, the n-point designs \(\mathbb {Z}_{n}\) with smallest CR provide the following: (a) the n-point min-max optimal quadratures (see [14, Ch.3,Th.1.1]), (b) min-max n-point global optimization methods in the set of all adaptive n-point optimization strategies (see [14, Ch.4,Th.2.1]), and (c) worst-case n-point multi-objective global optimization methods in the set of all adaptive n-point algorithms (see [20]). In all three cases, the class of (objective) functions is the class of Liptshitz functions, where the Liptshitz constant may be unknown. The results (a) and (b) are the celebrated results of A.G.Sukharev obtained in the late nineteen-sixties and (c) is a recent result of A. žilinskas.

If d is not small (say, d > 5) then computation of the covering radius CR\((\mathbb {Z}_{n}) \) for any non-trivial design \(\mathbb {Z}_{n}\) is a very difficult computational problem. This explains why the problem of construction of optimal n-point designs with smallest covering radius is notoriously difficult (see, for example, recent surveys [16, 17]).

If \(r= \textit {CR}(\mathbb {Z}_{n})\), then \(C_{d}(\mathbb {Z}_{n},r)\) defined in (1) is equal to 1, and the whole cube \({\mathcal {C}}_{d}\) gets covered by the balls. However, we are only interested in reaching the values like 0.9, when a large part of the ball is covered. There are two main reasons why we are not interested in reaching the value \(C_{d}(\mathbb {Z}_{n},r)= 1\): (a) practical impossibility of making a numerical checking of the full coverage, if d is large enough, and (b) our approximations lose accuracy when \(C_{d}(\mathbb {Z}_{n},r)\) closely approaches 1.

If, for a given γ ∈ [0, 1), we have \(C_{d}(\mathbb {Z}_{n},r)\geq 1-\gamma \), then the corresponding coverage of \({\mathcal {C}}_{d}\) will be called (1 − γ)-coverage; the corresponding value of r can be called (1 − γ)-covering radius. If γ = 0 then the (1 − γ)-coverage becomes the full coverage and 1-covering radius of \(\mathbb {Z}_{n}\) becomes \(C_{d}(\mathbb {Z}_{n},r)\). Of course, for any \(\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}\) we can reach \(C_{d}(\mathbb {Z}_{n},r)= 1\) by means of increasing r. Likewise, for any given r we can reach \(C_{d}(\mathbb {Z}_{n},r)= 1\) by sending \(n \to \infty \). However, we are not interested in very large values of n and try to get the coverage of the most part of the cube \({\mathcal {C}}_{d}\) with the radius r as small as possible. We will keep in mind the following typical values of d and n: d = 10,20,50; n = 64,128,512,1024. Correspondingly, we will illustrate our results in such scenarios.

1.3 Two Contradictory Criteria and a Compromise

In choosing \(\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}\), the following two main criteria must be followed:

  • (i) the volumes of intersections of the cube \({\mathcal {C}}_{d}\) and each individual ball \({{\mathscr{B}}}_{d}(Z_{j},r)\) are not very small;

  • (ii) the volumes of intersections \({{\mathscr{B}}}_{d}(Z_{j},r) \cap {{\mathscr{B}}}_{d}(Z_{i},r)\) are small for all ij(i,j = 1, … , n).

These two criteria do not agree with each other. Indeed, as shown in Section 2 (see formulas (11)–(14)), the volume of intersection of the ball \({{\mathscr{B}}}_{d}(Z,r)\) and the cube \({\mathcal {C}}_{d}\) is approximately inversely proportional to ∥Z∥ and hence criterion (i) favours Zj with small norms. However, if at least some of the points Zj get close to 0, then the distance between these points gets small and, in view of the formulas of Section A1.7, the volumes of intersections \({{\mathscr{B}}}_{d}(Z_{j},r) \cap {{\mathscr{B}}}_{d}(Z_{i},r)\) get large.

This yields that the above two criteria require a compromise in the rule of choosing \(\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}\) as the points Zj should not be too far from 0 but at the same time, not too close. In particular, and this is clearly demonstrated in many examples throughout the paper, the so-called uniformly distributed sequences of points in \({\mathcal {C}}_{d}\), including ‘low-discrepancy sequences’ in \({\mathcal {C}}_{d}\), provide poor covering schemes. This is in a sharp contrast with the asymptotic case \(n \to \infty \) (and hence r → 0), when one of the recommendations (see [2, p. 84]) is to choose Zj’s from a uniformly distributed sequence of points from a set which is slightly larger than \({\mathcal {C}}_{d}\); this is to facilitate covering of the boundary of \({\mathcal {C}}_{d}\), as it is much easier to cover the interior of the cube \({\mathcal {C}}_{d}\) than its boundary.

In our considerations, n is not very large and hence the radius of balls r cannot be small. One of our recommendations for choosing \(\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}\) is to choose Zj’s at random in a cube \({\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}\) (with 0 < δ ≤ 1) with components distributed according to a suitable beta-distribution. The optimal value of δ is always smaller than 1 and depends on d and n. If d is small or n is astronomically large, then the optimal value of δ could be close to 1 but in most interesting instances this value is significantly smaller than 1. This implies that the choice δ = 1 (for example, if Zj’s form a uniformly distributed sequence of points in the whole cube \({\mathcal {C}}_{d}\)) often leads to very poor covering schemes, especially when the dimension d is large (see Tables 12 and 3 in discussed in Section 3). More generally, we show that for construction of efficient designs \(\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}\), either deterministic or randomized, we have to restrict the norms of the design points Zj. We will call this principle ‘δ-effect’.

Table 1 Values of r and δ (in parentheses) to achieve 0.9 coverage for d = 10
Table 2 Values of r and δ (in parentheses) to achieve 0.9 coverage for d = 20
Table 3 Values of r and δ (in parentheses) to achieve 0.9 coverage for d = 50

1.4 Covering a Cube by Smaller Cubes and Quantization

In Section 4 we consider the problem of (1 − γ)-coverage of the cube \({\mathcal {C}}_{d}=[-1,1]^{d}\) by smaller cubes (which are \(L_{\infty }\)-balls). The problem of 1-covering of cube by cubes has attracted a reasonable attention in mathematical literature (see, e.g., [3, 6]). The problem of (1 − γ)-coverage of a cube by cubes happened to be simpler than the main problem of (1 − γ)-coverage of a cube by Euclidean balls and we have managed to derive closed-form expressions for (a) the volume of intersection of two cubes, and (b) (1 − γ) coverage, the probability of covering a random point in \({\mathcal {C}}_{d}\) by n cubes \({\mathcal {C}}_{d}(Z_{i},r)\) for a wide choice of randomized schemes of choosing designs \(\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}\). The results of Section 4 show that the δ-effect holds for the problem of coverage of the cube by smaller cubes in the same degree as for the main problem of Section 3 of covering with balls.

Section 5 is devoted to the following problem of quantization also known as the problem of facility location. Let X = (x1, … , xd) be uniform on \({\mathcal {C}}_{d}=[-1,1]^{d}\) and \(\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}\) be an n-point design. The mean square quantization error is \(\theta _{n}=\theta (\mathbb {Z}_{n}):=\mathbb {E}_{X}\underset {i=1, \ldots , n}{\min \limits } \|X-Z_{i}\|^{2}\). In the case where Z1, … , Zn are i.i.d. uniform on \({\mathcal {C}}_{d}(\delta )\), we will derive a simple approximation for the expected value of \(\theta (\mathbb {Z}_{n})\) and clearly demonstrate the δ-effect. Moreover, we will notice a strong similarity between efficient quantization designs and efficient designs constructed in Section 3.

1.5 Structure of the Paper and Main Results

In Section 2 we derive accurate approximations for the volume of intersection of an arbitrary d-dimensional cube with an arbitrary d-dimensional ball. These formulas will be heavily used in Section 3, which is the main section of the paper dealing with the problem of (1 − γ)-coverage of a cube by n balls. In Section 4 we extend some considerations of Section 3 to the problem of (1 − γ)-coverage of the cube \({\mathcal {C}}_{d}\) by smaller cubes. In Section 5 we argue that there is a strong similarity between efficient quantization designs and efficient designs of Section 3. In Appendix 1, we briefly mention several facts, used in the main part of the paper, related to high-dimensional cubes and balls. In Appendix 2, we prove two simple but very important lemmas about distribution and moments of certain random variables.

Our main contributions in this paper are:

  • an accurate approximation (18) for the volume of intersection of an arbitrary d-dimensional cube with an arbitrary d-dimensional ball;

  • an accurate approximation (26) for the expected volume of intersection of the cube \({\mathcal {C}}_{d}\) with n balls with uniform random centres \(Z_{j} \in {\mathcal {C}}_{d}(\delta )\);

  • closed-form expression of Section 4.2 for the expected volume of intersection the cube \({\mathcal {C}}_{d}\) with n cubes with uniform random centres \(Z_{j} \in {\mathcal {C}}_{d}(\delta )\);

  • construction of efficient schemes of quantization and (1 − γ)-coverage of the cube \({\mathcal {C}}_{d}\) by n balls;

  • large-scale numerical study.

We are preparing an accompanying paper [9] in which we will further explore the topics of Sections 35 and also consider the problems of quantization and (1 − γ)-coverage in the whole space \(\mathbb {R}^{d}\) and the problem of (1 − γ)-coverage of simplices.

2 Volume of Intersection of a Cube and a Ball

2.1 The Main Quantity of Interest

Consider the following problem. Let us take the cube \({\mathcal {C}}_{d}=[-1,1]^{d}\) of volume \(\text {vol}({\mathcal {C}}_{d})=2^{d}\) and a ball \( {{\mathscr{B}}}_{d}(Z,r) = \{Y \in \mathbb {R}^{d}: \| Y-Z \| \leq r\} \) centred at a point \(Z=(z_{1}, \ldots , z_{d}) \in \mathbb {R}^{d}\); this point Z could be outside \({\mathcal {C}}_{d}\). Denote the fraction of the cube \({\mathcal {C}}_{d}\) covered by the ball \({{\mathscr{B}}}_{d}(Z, r)\) by

$$ C_{d,Z,{ r }}={\text{vol}({\mathcal{C}}_{d} \cap {\mathcal{B}}_{d}(Z,{ r }))}/2^{d} . $$
(3)

Our aim is to approximate Cd,Z,r for arbitrary d, Z and r. To do this, we shall use CLT (Central Limit Theorem). We will derive a CLT-based normal approximation in Section 2.3 and then, using an asymptotic expansion in the CLT for non-identically distributed r.v., we will improve this normal approximation in Section 2.4. In Section A1.8 we consider a more direct approach for approximating Cd,Z,r based on the use of characteristic functions and the fact that Cd,Z,r is a c.d.f. of ∥UZ∥, where U = (u1, … , ud) is random vector with uniform distribution on \({\mathcal {C}}_{d}\). From this, Cd,Z,r can be expressed through the convolution of one-dimensional c.d.f’s. Using this approach we can evaluate the quantity Cd,Z,r with high accuracy but the calculations are rather time-consuming. Moreover, entirely new computations have to be made for different Z and, therefore, we much prefer the approximation of Section 2.4.

Note that in the special case Z = 0, several approximations for the quantity Cd,0,r have been derived in [15] but their methods cannot be generalized to arbitrary Z.

2.2 A Generalization of the Quantity (3)

In the next sections, we will need another quantity which slightly generalizes (3). Assume that we have the cube \({\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}\) of volume \(\text {vol}({\mathcal {C}}_{d}(\delta ))=(2\delta )^{d}\), the ball \({{\mathscr{B}}}_{d}(Z^{\prime },{ r^{\prime } })= \{ Y \in \mathbb {R}^{d}: \| Y-Z^{\prime } \| \leq { r^{\prime } }\}\) with a centre at a point \(Z^{\prime }=(z_{1}^{\prime }, \ldots , z_{d}^{\prime }) \). Denote the fraction of the cube \({\mathcal {C}}_{d}(\delta )\) covered by the ball \({{\mathscr{B}}}_{d} (Z^{\prime }, r^{\prime })\) by

$$ {C}_{d,Z^{\prime},{ r^{\prime} }}^{(\delta)}={\text{vol}({\mathcal{C}}_{d}(\delta) \cap {\mathcal{B}}_{d(Z^{\prime}},{ r^{\prime} }))}/(2\delta)^d . $$
(4)

Then, by changing the coordinates and the radius

$$ Z=Z^{\prime}/\delta= ({z}_{1}^{\prime}/\delta, \ldots, z_{d}^{\prime}/\delta) \text{and } r= r^{\prime}/\delta , $$
(5)

we obtain

$$ {C}_{d,Z^{\prime},{ r^{\prime} }}^{(\delta)}=C_{d,Z,{ r }} . $$
(6)

2.3 Normal Approximation for the Quantity (3)

Let U = (u1, … , ud) be a random vector with uniform distribution on \({\mathcal {C}}_{d}\) so that u1, … , ud are i.i.d.r.v. uniformly distributed on [− 1, 1]. Then for given \(Z=(z_{1}, \ldots , z_{d}) \in \mathbb {R}^{d}\) and any r > 0,

$$ C_{d,Z,{ r }} = \mathbb{P} \left\{ \| U - Z \| \leq { r } \right\} = \mathbb{P} \left\{ \| U - Z \|^{2} \leq { r^{2} } \right\} = \mathbb{P} \left\{\sum\limits_{j=1}^{d} (u_j - z_j)^{2} \leq { r }^{2} \right\}. $$

That is, Cd,Z,r, as a function of r, is the c.d.f. of the r.v. ∥UZ∥.

Let u have a uniform distribution on [− 1,1] and |z|≤ 1. In view of Lemma 1 of Appendix 2, the density of the r.v. ηz = (uz)2 is

$$ \varphi_{z(t)} = \left\{ \begin{array}{ll} 1/(2\sqrt{t}) & \text{for} 0<t\leq (1- |z|)^{2}\\ 1/(4\sqrt{t}) & \text{for} (1- |z|)^{2}<t\leq (1+ |z|)^{2} \\ 0 & \text{otherwise} \end{array} \right. $$
(7)

and

$$ \mathbb{E}\eta_{z} =z^{2} +\frac{1}{3}, \text{var}(\eta_{z}) = \frac{4}{3} \left( z^{2} +\frac{1}{15} \right) , {\mu}_{z}^{(3)}= \frac{16}{15} \left( z^{2} +\frac{1}{63} \right) , $$
(8)

where \({\mu }_{z}^{(3)}\) is the third central moment: \({\mu }_{z}^{(3)}=E \left [\eta _{z} - E\eta _{z}\right ]^{3}\).

For |z| > 1, the density of ηz = (uz)2 is

$$ \varphi_{z(t)} = \left\{\begin{array}{ll} 1/(4\sqrt{t}) & \text{for } (1-|z|)^{2}<t\leq (1+|z|)^{2} \\ 0 & \text{otherwise} \end{array} \right. $$
(9)

with expressions (8) for \(\mathbb {E}\eta _{z}\), var(ηz) and \({\mu }_{z}^{(3)}\) not changing.

Consider the r.v.

$$ \| U-Z \|^{2} ={\sum}_{i=1}^d \eta_{z_j}={\sum}_{j=1}^{d} (u_j-z_j)^{2} . $$
(10)

From (8), its mean is

$$ \mu_{d,Z}=\mathbb{E}\| U-Z \|^{2} =\|Z\|^{2} +\frac{d}{3} . $$
(11)

Using independence of u1, … , ud, we also obtain from (8):

$$ {\sigma}_{d,Z}^{2}=\text{var}(\| U-Z \|^{2} ) = \frac{4}{3} \left( \|Z\|^{2} +\frac{d}{15}\right) $$
(12)

and

$$ {\mu}_{d,Z}^{(3)}= \mathbb{E}\left[\| U-Z \|^{2}- \mu_{d,Z}\right]^{3} = {\sum}_{j=1}^{d} \mu_{z_j}^{(3)} = \frac{16}{15} \left( \|Z\|^{2} +\frac{d}{63}\right) . $$
(13)

If d is large enough then the conditions of the CLT for ∥UZ2 are approximately met and the distribution of ∥UZ2 is approximately normal with mean μd,Z and variance \({\sigma }_{d,Z}^{2}\). That is, we can approximate Cd,Z,r by

$$ C_{d,Z,{ r }} \cong {\varPhi} \left( \frac{{ r }^{2}-\mu_{d,Z}}{{\sigma}_{d,Z}} \right) , $$
(14)

where Φ(⋅) is the c.d.f. of the standard normal distribution:

$$ {\varPhi} (t) = {\int}_{-\infty}^{t} \varphi(v)dv \text{with} \varphi(v)=\frac{1}{\sqrt{2\pi}} e^{-v^{2}/2} . $$

The approximation (14) has acceptable accuracy if Cd,Z,r is not very small; for example, it falls inside a 2σ-confidence interval generated by the standard normal distribution (see Figs. 1a and 1b as examples). Let pβ be the quantile of the standard normal distribution defined by Φ(β) = 1 − pβ; for example, \(p_{\beta } \simeq 0.05\) for β = 2. As follows from (11), (12) and the approximation (14), we expect the approximate inequality \(C_{d,Z,{ r }} \gtrapprox p_{\beta }\) if

$$ r \geq R_{d,\|Z\|,\beta}=\left[\|Z\|^{2}+d/3 - 2\beta \sqrt{\|Z\|^{2}/3+d/45 }\right]^{1/2} . $$
(15)
Fig. 1
figure 1

(a) d = 10, Z = 0, r ∈ [1,2.5], (b) d = 50, Z = 0, r ∈ [3.2,4.9]

In many cases discussed in Section 3, the radius r does not satisfy the inequality (15) with β = 2 and even β = 3 and hence the normal approximation (14) is not satisfactorily accurate; this can be evidenced from Figs. 1a, 1b, 2a, 2b, 3a, 3b, 4a, 4b, 5a, 5b, 6a, 6b, 7a, 7b, 8a and 8b.

Fig. 2
figure 2

(a) d = 10, Z = 0, r ∈ [0.95,1.25], (b) d = 50, Z = 0, r ∈ [3.2,3.5]

Fig. 3
figure 3

(a) d = 10, Z is a vertex of \({\mathcal {C}}_{d}\), r ∈ [2, 5], (b) d = 50, Z is a vertex of \({\mathcal {C}}_{d}\), r ∈ [6.5, 9.5]

Fig. 4
figure 4

(a) d = 10, Z is a vertex of \({\mathcal {C}}_{d}\), r ∈ [1.9, 2.5], (b) d = 50, Z is a vertex of \({\mathcal {C}}_{d}\), r ∈ [6.5, 7]

Fig. 5
figure 5

(a) Z is at half-diagonal with \(\|Z\|=\frac {1}{2} \sqrt {10}\), (b) Z is at half-diagonal, \(\|Z\|=\frac {1}{2} \sqrt {50}\)

Fig. 6
figure 6

(a)Z is at half-diagonal, \(\|Z\|=\frac {1}{2} \sqrt {10}\), (b) Z is at half-diagonal, \(\|Z\|=\frac {1}{2} \sqrt {50}\))

Fig. 7
figure 7

(a) d = 10, \(Z\in {\mathcal {S}}_{10}(0, 1.5)\), r ∈ [1, 3.5], (b) d = 50, \(Z\in {\mathcal {S}}_{50}(0, 1.75)\), r ∈ [3.5, 5.5]

Fig. 8
figure 8

(a) d = 10, \(Z\in {\mathcal {S}}_{10}(0, 1.5)\), r ∈ [1, 1.4], (b) d = 50, \(Z\in {\mathcal {S}}_{50}(0, 1.75)\), r ∈ [3.5, 3.75]

In the next section, we improve the approximation (14) by using an Edgeworth-type expansion in the CLT for sums of independent non-identically distributed r.v.

2.4 Improved Normal Approximation

General expansion in the central limit theorem for sums of independent non-identical r.v. has been derived by V. Petrov (see Theorem 7 in Chapter 6 in [10]; see also Proposition 1.5.7 in [12]). The first three terms of this expansion have been specialized by V. Petrov in Section 5.6 in [11]. By using only the first term in this expansion, we obtain the following approximation for the distribution function of ∥UZ2:

$$ P\left( \frac{\| U-Z \|^{2}-\mu_{d,Z}}{\sigma_{d,Z}} \leq x \right) \cong {\varPhi}(x) + \frac{ {\mu}_{d,Z}^{(3)}}{6 \left( {\sigma}_{d,Z}^{2}\right)^{3/2} }(1-x^{2})\varphi(x), $$

leading to the following improved form of (14):

$$ C_{d,Z,{ r }} \cong {\varPhi}(t) + \frac{ \|Z\|^{2}+d/63}{5\sqrt{3} (\|Z\|^{2}+d/15)^{3/2} }(1-t^{2})\varphi(t) , $$
(16)

where

$$ t = t_{d,\|Z\|,{ r }}= \frac{{ r }^{2}-\mu_{d,Z}}{{\sigma}_{d,Z}} = \frac{\sqrt{3}(r^{2}- \|Z\|^{2} -d/3)}{2\sqrt{ \|Z\|^{2} +{d}/{15}} } . $$
(17)

From the viewpoint of Section 3, the range of most important values of t from (17) is − 3 ± 1. For such values of t, the uncorrected normal approximation (14) significantly overestimates the values of Cd,Z,r (see Figs. 1a, 1b, 2a, 2b, 3a, 3b, 4a, 4b, 5a, 5b, 6a, 6b, 7a, 7b, 8a and 8b below). The approximation (16) brings the normal approximation down and makes it much more accurate. The other terms in Petrov’s expansion of [10] and [11] continue to bring the approximation down (in a much slower fashion) so that the approximation (16) still slightly overestimates the true value of Cd,Z,r (at least, in the range of interesting values of t from (17)). However, if d is large enough (say, d ≥ 20) then the approximation (16) is very accurate and no further correction is needed.

A very attractive feature of the approximations (14) and (17) is their dependence on Z through ∥Z∥ only. We could have specialized for our case the next terms in Petrov’s approximation but these terms no longer depend on ∥Z∥ only (this fact can be verified from the formula (46) for the fourth moment of the r.v. νz = (zu)2) and hence the next terms are much more complicated. Moreover, adding one or two extra terms from Petrov’s expansion to the approximation (16) does not fix the problem entirely for all Z and r. Instead, we propose a slight adjustment to the r.h.s of (16) to improve this approximation, especially for small dimensions. Specifically, we suggest the approximation

$$ C_{d,Z,{ r }} \cong {\varPhi}(t) + c_d\frac{\|Z\|^{2}+d/63}{5\sqrt{3} (\|Z\|^{2}+d/15)^{3/2} }(1-t^{2})\varphi(t) , $$
(18)

where cd = 1 + 3/d if the point Z lies on the diagonal of the cube \({\mathcal {C}}_{d}\) and cd = 1 + 4/d for a typical (random) point Z. For typical (random) points \(Z \in {\mathcal {C}}_{d}\), the values of Cd,Z,r are marginally smaller than for the points on the diagonal of \({\mathcal {C}}_{d}\) having the same norm, but the difference is very small. In addition to the points on the diagonal, there are other special points: the points whose components are all zero except for one. For such points, the values of Cd,Z,r are smaller than for typical points Z with the same norm, especially for small r. Such points, however, are of no value for us as they are not typical and we have never observed in simulations random points that come close to these truly exceptional points.

2.5 Simulation Study

In Figs.1a, 1b, 2a, 2b, 3a, 3b, 4a, 4b, 5a, 5b, 6a, 6b, 7a, 7b, 8a and 8b we demonstrate the accuracy of approximations (14), (16) and (18) for Cd,Z,r in dimensions d = 10,50 for the following locations of Z:

  • (i) Z = 0, the centre of the cube \({\mathcal {C}}_{d}\);

  • (ii) \(\|Z\|=\sqrt {d}\), with Z being a vertex of the cube \({\mathcal {C}}_{d}\);

  • (iii) Z lies on a diagonal of \({\mathcal {C}}_{d}\) with |zj| = λ ≥ 0 for all j = 1, … , d and \(\|Z\|=\lambda \sqrt {d}\);

  • (iv) Z is a random vector uniformly distributed on the sphere \({\mathcal {S}}_{d}(0,v)\) with some v > 0.

There are figures of two types. In the figures of the first type, we plot Cd,Z,r over a wide range of r ensuring that values of Cd,Z,r lie in the whole range [0, 1]. In the figures of the second type, we plot Cd,Z,r over a much smaller range of r with Cd,Z,r lying in the range [0,ε] for some small positive ε such as ε = 0.015. For the purpose of using the approximations of Section 3, we need to assess the accuracy of all approximations for smaller values of Cd,Z,r and hence the second type of plots are often more insightful. In Figs. 1a, 1b, 2a, 2b, 3a, 3b, 4a, 4b, 5a, 5b, 6a, 6b, 7a and 7b the solid black line depicts values of Cd,Z,r computed via Monte Carlo methods, the blue dashed, the red dot-dashed and green long dashed lines display approximations (14), (16) and (18), respectively.

In the case where Z is a random vector uniformly distributed on a sphere \({\mathcal {S}}_{d}(0,v)\), the style of the figures of the second type is slightly changed to adapt for this choice of Z and provide more information for Z which do or do not belong to the cube \({\mathcal {C}}_{d}\). In Figs. 8a and 8b, the thick dashed red lines correspond to random points \(Z \in {\mathcal {S}}_{d}(0,v) \cap {\mathcal {C}}_{d}\). The thick dot-dashed orange lines correspond to random points \(Z \in {\mathcal {S}}_{d}(0,v)\) such that \(Z\not \in {\mathcal {C}}_{d}\). Approximations (14) and (16) are depicted in the same manner as previous figures but the approximation (18) is now represented by a solid green line. The thick solid red line displays values of Cd,Z,r for Z on the diagonal of \({\mathcal {C}}_{d}\) with ∥Z∥ = v with v = 1.5 for d = 10 and v = 1.75 for d = 50.

From the simulations that led to Figs. 1a, 1b, 2a, 2b, 3a, 3b, 4a, 4b, 5a, 5b, 6a, 6b, 7a, 7b, 8a and 8b we can make the following conclusions.

  • The normal approximation (14) is quite satisfactory unless the value Cd,Z,r is small.

  • The accuracy of all approximations improves as d grows.

  • The approximation (18) is very accurate even if the values Cd,Z,r are very small.

  • If d is large enough then the approximations (16) and (18) are practically identical and are extremely accurate.

3 Covering a Cube by n Balls

In this section, we consider the main problem of covering the cube \({\mathcal {C}}_{d}=[-1,1]^{d}\) by the union of n balls \({{\mathscr{B}}}_{d}(Z_{j},r)\) as formulated in Section 1.2. We will discuss different schemes of choosing the set of ball centres \(\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}\) for given d and n. The radius r will then be chosen to achieve the required probability of covering: \(C_{d}(\mathbb {Z}_{n},r)\geq 1-\gamma \). Most of the schemes will involve one or several parameters which we will want to choose in an optimal way.

3.1 The Main Covering Scheme

The following will be our main scheme for choosing \(\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}\).

  • Scheme 1. Z1, … , Zn are i.i.d. random vectors uniformly distributed in the cube \({\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}\), where δ ∈ [0, 1] is a parameter

We will formulate several other covering schemes and compare them with Scheme 1. The reasons why we have chosen Scheme 1 as the main scheme are the following.

  • It is easier to theoretically investigate than all other non-trivial schemes.

  • It includes, as a special case when δ = 1, the scheme which is very popular in practice of Monte Carlo [8] and global random search [18, 19] and is believed to be rather efficient (this is not true).

  • Numerical studies provided below show that Scheme 1 with optimal δ provides coverings which are rather efficient, especially for large d (see Section 3.5 for a discussion regarding this issue).

3.2 Theoretical Investigation of Scheme 1

Let Z1, … , Zn be i.i.d. random vectors uniformly distributed in the cube \({\mathcal {C}}_{d}(\delta )\) with 0 < δ ≤ 1. Then, for given \(U = (u_{1}, \ldots , u_{d}) \in \mathbb {R}^{d}\),

$$ \begin{array}{@{}rcl@{}} \mathbb{P} \left\{ U \in {\mathcal{B}}_{d}(\mathbb{Z}_{n},r) \right\}&=& 1-\prod\limits_{j=1}^{n} \mathbb{P} \left\{ U \notin {\mathcal{B}}_{d}({Z}_j,r) \right\} \\ &=& 1-\prod\limits_{j=1}^{n}\left( 1-\mathbb{P} \left\{ U \in {\mathcal{B}}_{d}({Z}_j,r) \right\} \right) \\ &=& 1-\left( 1-\mathbb{P}_{_Z} \left\{ \|U - {Z} \| \leq r \right\} \right)^{n} \end{array} $$
(19)

where \({{\mathscr{B}}}_{d}(\mathbb {Z}_{n},r)\) is defined in (2). The main characteristic of interest \(C_{d}(\mathbb {Z}_{n},r)\), defined in (1), the proportion of the cube covered by the union of balls \({{\mathscr{B}}}_{d}(\mathbb {Z}_{n},r)\), is simply

$$ C_{d}(\mathbb{Z}_{n},r) = \mathbb{E}_{_U} \mathbb{P} \left\{ U \in {\mathcal{B}}_{d}(\mathbb{Z}_{n},r) \right\} $$
(20)

Continuing (19), note that

$$ \mathbb{P}_{_Z} \left\{ \|U - {Z} \| \leq r \right\} = \mathbb{P}_{_Z} \left\{ \sum\limits_{j=1}^{d} (z_j-u_j)^{2} \leq r^{2} \right\} = {C}_{d,U,{ r }}^{(\delta)} , $$
(21)

where \({C}_{d,U,{ r }}^{(\delta )}\) is defined by the formula (4). From (5) and 6 we have \( \label {eq:inters2ae} {C}_{d,U,{ r }}^{(\delta )}=C_{d,U/\delta ,{ r/\delta }} \) where Cd,U/δ,r/δ is the quantity defined by (3). This quantity can be approximated in a number of different ways as shown in Section 2. We will compare (14), the simplest of the approximations, with the approximation given in (18). Approximation (14) gives

$$ {C}_{d,U,{ r }}^{(\delta)}=C_{d, U/\delta,{ r/ \delta }} \cong {\varPhi} \left( \frac{{ (r/ \delta) }^{2}- \| U \|^{2}/\delta^{2} -d/3 }{2 \sqrt{ \|U\|^{2}/(3\delta^{2}) + d/45} } \right) , $$
(22)

whereas approximation (18) provides

$$ {C}_{d,U,{ r }}^{(\delta)} \cong {\varPhi}(t_{\delta}) + c_d\frac{ \|U\|^{2}/\delta^{2}+d/63}{5\sqrt{3} (\|U\|^{2}/\delta^{2}+d/15)^{3/2} }(1-{t}_{\delta}^{2})\varphi(t_{\delta}) , $$
(23)

with cd = 1 + 4/d and

$$ t_{\delta} = \frac{(r/\delta)^{2}- \|U\|^{2}/\delta^{2} -d/3}{2\sqrt{\|U\|^{2}/(3\delta^{2}) +d/45 } } . $$

From (37), \(\mathbb {E} \| U \|^{2}= d/3\) and var(∥U2) = 4d/45. Moreover, if d is large enough then \(\| U \|^{2}= {\sum }_{j=1}^{d} {u_{j}^{2}}\) is approximately normal.

We shall simplify the expression (19) by using the approximation

$$ (1-t)^{n} \simeq e^{-nt} , $$
(24)

which is a good approximation for small values of t and moderate values of nt; this agrees with the ranges of d, n and r we are interested in.

We can combine the expressions (20) and (19) with approximations (22), (23) and (24) as well as with the normal approximation for the distribution of ∥U2, to arrive at two final approximations for \(C_{d}(\mathbb {Z}_{n},r)\) that differ in complexity. If the original normal approximation of (22) is used then we obtain

$$ C_{d}(\mathbb{Z}_{n},r) \simeq 1- {\int}_{-\infty}^{\infty} \psi_1(s) \varphi(s)d s, $$
(25)

with

$$ \psi_1(s)=\exp \left\{-n {\varPhi}(c_{s}) \right\} , \ c_{s}= \frac{{3 (r/\delta) }^{2}- s^{\prime} -d }{2 \sqrt{ s^{\prime} + d/5} } , s^{\prime}=(d+2s \sqrt{d/5})/\delta^{2} . $$

If approximation (23) is used, we obtain:

$$ C_d(\mathbb{Z}_n,r) \simeq 1- {\int}_{-\infty}^{\infty} \psi_2(s) \varphi(s)d s, $$
(26)

with

$$ \psi_{2}(s)=\exp \left\{-n \left( {\varPhi}(c_{s}) +\left( 1+\frac{4}{d}\right) \frac{s^{\prime}+d/21}{5[ s^{\prime} +d/5 ]^{3/2}}(1-{c}_s^{2})\varphi(c_{s}) \right) \right\} . $$

3.3 Simulation Study for Assessing Accuracy of Approximations (25) and (26)

In Figs. 9a, 9b, 10a, 10b, 11a and 11b, \(C_{d}(\mathbb {Z}_{n},r) \) is represented by a solid black line and has been obtained via Monte Carlo methods. Approximation (25) is indicated by a dashed blue line and approximation (26) is represented by long dashed green lines. All figures demonstrate that approximation (26) is extremely accurate across different dimensions and values of n. This approximation is much superior to approximation (25).

Fig. 9
figure 9

(a) \(C_{d}(\mathbb {Z}_{n},r)\) and approximations: n = 128, (b) \(C_{d}(\mathbb {Z}_{n},r)\) and approximations: n = 512

Fig. 10
figure 10

(a) \(C_{d}(\mathbb {Z}_{n},r) \) and approximations: n128, (b) \(C_{d}(\mathbb {Z}_{n},r) \) andapproximations: n = 512

Fig. 11
figure 11

(a) \(C_{d}(\mathbb {Z}_{n},r) \) and approximations: n = 512, (b) \(C_{d}(\mathbb {Z}_{n},r) \) and approximations: n = 512

3.4 Other Schemes

In addition to Scheme 1, we have also considered the following schemes for choosing \(\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}\).

  • Scheme 2. Z1 = 0; Z2, … , Zn are i.i.d. random vectors uniformly distributed in the cube \({\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}\)

  • Scheme 3. Z1, … , Zn are taken from the minimum-aberration fractional factorial design on vertices of the cube \({\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}\)

  • Scheme 4. Z1, … , Zn are i.i.d. random vectors on \({\mathcal {C}}_{d}(\delta )\) with independent components distributed according to beta-distribution with density (34) with some α > 0

  • Scheme 5. Z1, … , Zn are i.i.d. random vectors uniformly distributed in the ball \({{\mathscr{B}}}_{d}(\delta )\)

  • Scheme 6. Z1, … , Zn are i.i.d. random vectors uniformly distributed on the sphere \({\mathcal {S}}_{d}(\delta )\)

  • Scheme 7. Z1, … , Zn are taken from a low-discrepancy Sobol’s sequence on the cube \({\mathcal {C}}_{d}(\delta )\)

The rationale behind the choice of these schemes is as follows. By studying Scheme 2, we test the importance of inclusion of 0 into \(\mathbb {Z}_{n}\). We propositioned that if we included 0 into \(\mathbb {Z}_{n}\), the optimal value of δ may increase for some of the schemes making them more efficient; this effect has not been detected.

Scheme 3 with optimal δ is an obvious candidate for being the most efficient. Unlike all other schemes considered, Scheme 3 is only defined for the values of n of the form n = 2k with kd.

By using Scheme 4, we test the possibility of improving Scheme 1 by changing the distribution of points in the cube \({\mathcal {C}}_{d}(\delta )\). We have found that the effect of distribution is strong and smaller values of α lead to more efficient covering schemes. By choosing α small enough, like α = 0.1, we achieve the average efficiency of the covering schemes which is rather close to the efficiency of Scheme 3. Tables 12 and 3 contain results obtained for Scheme 4 with α = 0.5 and α = 1.5; if α = 1 then Scheme 4 becomes Scheme 1.

From Section A1.4, we know that for constructing efficient designs we have to somehow restrict the norms of Zj’s. In Schemes 5 and 6, we are trying to do this in an alternative way to Schemes 1 and 4.

Scheme 7 is a natural improvement of Scheme 1. As a particular case with δ = 1, it contains one of the best known low-discrepancy sequences and hence Scheme 7 with δ = 1 serves as the main benchmark with which we compare other schemes. For construction, we have used the R-implementation of the Sobol’s sequences; it is based on [4].

For all the schemes excluding Scheme 3, the sequences \(\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}\) are nested so that \(\mathbb {Z}_{n} \subset \mathbb {Z}_{m} \) for all n < m; using the terminology of [6], these schemes provide on-line coverings of the cube. Note that for the chosen values of n, Scheme 7 also has some advantage over other schemes considered. Indeed, despite Sobol’s sequences are nested, the values n of the form n = 2k are special for the Sobol’s sequences and for such values of n the Sobol’s sequences possess extra uniformity properties that they do not possess for other values of n.

3.5 Numerical Comparison of Schemes

In Tables 12 and 3, for Schemes 1, 2, 4, 5 and 6, we present the smallest values of r required to achieve an 0.9-coverage on average. For these schemes, the value inside the parentheses shows the average value of δ required to obtain 0.9-coverage. For Schemes 3 and 7, we give the smallest value of r needed for a 0.9-coverage. For these two schemes, the value within the bracket corresponds to the (non random) value of δ with which we attain such a coverage.

In Figs. 12a, 12b, 13a, 13b, 14a, 14b, 15a and 15b we plot \(C_{d}(\mathbb {Z}_{n},r) \) as a functions of δ ∈ [0, 1] across a number schemes, n and d. For these plots we have used the values of r provided in Tables 12 and 3 such that for Figs. 12a, 12b, 13a and 13b which correspond to Scheme 1 and Scheme 2, the maximum coverage is very close to 0.9 and the optimal δ is very close to the values presented in Tables 12 and3. For Figs. 12a, 12b, 13a, 13b, 14a, 14b, 15a and 15b the maximum coverage 0.9 is attained with δ provided in Tables 12 and 3. In Figs. 12a, 12b, 13a, 13b, 14a, 14b, 15a and 15b the solid green line, long dashed red line, dashed blue line and dot dashed orange line correspond to n = 64,128,512,and 1024 respectively. The vertical lines on these plots indicate the value of δ where the maximum coverage is obtained.

Fig. 12
figure 12

(a) Scheme 1: \(C_{d}(\mathbb {Z}_{n},r)\) across δ for d = 10, (b) Scheme 1: \(C_{d}(\mathbb {Z}_{n},r)\) across δ for d = 50

Fig. 13
figure 13

(a) Scheme 2: \(\protect {C}_{d}(\mathbb {Z}_{n},r)\) across δ for d = 10, (b) Scheme 2: \(\protect {C}_{d}(\mathbb {Z}_{n},r)\) across δ for d = 50

Fig. 14
figure 14

(a) Scheme 3: \(C_{d}(\mathbb {Z}_{n},r)\) across δ for d = 10, (b) Scheme 3: \(C_{d}(\mathbb {Z}_{n},r)\) across δ for d = 50

Fig. 15
figure 15

(a) Scheme 7: \(C_{d}(\mathbb {Z}_{n},r)\) across δ for d = 10, (b) Scheme 7: \(C_{d}(\mathbb {Z}_{n},r)\) across δ for d = 50

From Tables 12 and 3 and Figs.12a, 12b, 13a, 13b, 14a, 14b, 15a and 15b we arrive at the following conclusions:

  • the δ-effect is very important and getting much stronger as d increases;

  • coverage of unadjusted low-discrepancy sequences is extremely low;

  • properly δ-tuned deterministic Scheme 3, which uses fractional factorial designs of minimum abberation, provides excellent covering;

  • randomized Scheme 4 with suitably chosen parameters of the beta-distribution, also provides very high quality coverage (on average);

  • for all schemes considered, the coverings with the optimal values of δ fully comply with the result of Section A1.4 describing the area of volume concentration in the cube \({\mathcal {C}}_{d}\).

4 Covering a Cube by Cubes

4.1 Volume of Intersection of Two Cubes

Let us take two cubes: \({\mathcal {C}}_{d}=[-1,1]^{d}\) and \({\mathcal {C}}_{d}(Z,r) = \{ Y \in \mathbb {R}^{d}: \| Y - Z\|_{\infty } \leq { r } \}\), a cube of side length 2r centred at a point \(Z =(z_{1}, \ldots , z_{d}) \in {\mathcal {C}}_{d}\). Denote the fraction of the cube \({\mathcal {C}}_{d}\) covered by \({\mathcal {C}}_{d}(Z,r)\) by

$$ \begin{array}{@{}rcl@{}} F_{d,Z,r}={\text{vol}({\mathcal{C}}_{d} \cap {\mathcal{C}}_d(Z,{ r }))}/2^{d} . \end{array} $$
(27)

Let, like in Section 2.3, U = (u1, … , ud) be a random vector with uniform distribution on \({\mathcal {C}}_{d}\) so that u1, … , ud are i.i.d.r.v. uniformly distributed on [− 1,1]. Then

$$ F_{d,Z,{ r }}= \mathbb{P} \left\{ \| U-Z \|_{\infty} \leq { r } \right\}= \mathbb{P} \left\{\max_{1\leq j \leq d} |u_j-z_j| \leq r \right\} . $$

That is, Fd,Z,r, as a function of r, is the c.d.f. of the r.v. \(\| U-Z \|_{\infty } = \max \limits _{1\leq j \leq d} |u_{j}-z_{j}|\).

From Lemma 2 of Appendix 2 the c.d.f. of the r.v. |ujzj| is

$$ \begin{array}{@{}rcl@{}} G_{d,z_j}(t) = \mathbb{P}\{|u_j - z_j| \leq t \}= \left\{\begin{array}{cl} 0 & \text{for} t\leq 0\\ t & \text{for} 0<t< 1- |z_j|\\ \frac{1}{2}({1 + t - |z_j|}) & \text{for} 1- |z_j| \leq t\leq 1+ |z_j| \\ 1 & 1+ |z_j| < t . \end{array} \right. \end{array} $$

Since the c.d.f. of a maximum of independent r.v. is the product of marginal c.d.f.’s, we obtain

$$ F_{d,Z,{ r }}= {\prod}_{j=1}^{d} G_{d,z_{j}}(r) . $$

Two extreme particular cases of location of Z are:

  • (i) Z = 0: Fd,0,r = rd, 0 ≤ r ≤ 1;

  • (ii) \(\|Z\|=\sqrt {d}\), when Z being a vertex of the cube \({\mathcal {C}}_{d}\): Fd,V,r = (r/2)d, 0 ≤ r ≤ 2.

Assume now that we have the cube \({\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}\) of volume (2δ)d and another cube \( {\mathcal {C}}_{d}(Z^{\prime },{ r^{\prime } })= \{ Y \in \mathbb {R}^{d}: \| Y-Z^{\prime } \|_{\infty } \leq r^{\prime }\} \) with a centre at a point \(Z^{\prime }=(z_{1}^{\prime }, \ldots , z_{d}^{\prime }) \). Denote the fraction of the cube \({\mathcal {C}}_{d}(\delta )\) covered by \({\mathcal {C}}_{d}(Z^{\prime }, r^{\prime })\) by

$$ {F}_{d,Z^{\prime},{ r^{\prime} }}^{(\delta)}={\text{vol}({\mathcal{C}}_{d}(\delta) \cap {\mathcal{C}}_d(Z^{\prime},{ r^{\prime} }))}/(2\delta)^d . $$

Then by changing the coordinates and the radius using (5) we get \({F}_{d,Z^{\prime },{ r^{\prime } }}^{(\delta )}=F_{d,Z/\delta ,{ r/\delta }} . \)

4.2 Proportion of a Cube Covered by Smaller Cubes with Random Centres

Let us take the cube \({\mathcal {C}}_{d}=[-1,1]^{d}\) and n smaller cubes \({\mathcal {C}}_{d}(Z_{j},{ r })= \{ Y \in \mathbb {R}^{d}: \| Y-Z_{j} \|_{\infty } \leq { r } \}\) with centres at points \(Z_{j} \in \mathbb {R}^{d}\). Denote the fraction of the cube \({\mathcal {C}}_{d}\) covered by \({\mathcal {C}}_{d}(\mathbb {Z}_{n},{ r })= \cup _{j=1}^{n} {\mathcal {C}}_{d}(Z_{j}, {r })\), the union of these cubes, by

$$ C_{d,\mathbb{Z}_{n},{ r }}={\text{vol}({\mathcal{C}}_{d} \cap {\mathcal{C}}_d(\mathbb{Z}_{n},{ r }))}/2^{d} . $$

Our aim is to obtain a closed form expression for this quantity for arbitrary d,r and n in the case when Z1, … , Zn are i.i.d. random vectors uniformly distributed in the cube \({\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}\) with 0 < δ ≤ 1.

Similarly to the combination of (19) with (21), for a given \(U = (u_{1}, \ldots , u_{d}) \in \mathbb {R}^{d}\),

$$ \mathbb{P} \left\{ U \in {\mathcal{C}}_d(\mathbb{Z}_{n},r) \right\}= 1-\left( 1-F_{d, U/\delta,{ r/\delta }} \right)^{n} . $$

Similarly to (20),

$$ \begin{array}{@{}rcl@{}} C_{d,\mathbb{Z}_{n},{ r }} = \mathbb{E}_{_U} \mathbb{P} \left\{ U \in {\mathcal{C}}_d(\mathbb{Z}_{n},r) \right\} = 1- \mathbb{E}_{_U} \left( 1-F_{d, U/\delta,{ r/\delta }} \right)^{n}. \end{array} $$

For an integer k, set

$$ I_k = \frac{1}{2} {\int}_{-1}^{1} \left[G_{d,u/\delta}(r/\delta) \right]^{k}du . $$
(28)

Then, using the binomial theorem, we have

$$ C_{d,\mathbb{Z}_{n},{ r }} = 1 - {\sum}_{k=0}^{n}(-1)^{k} \binom{n}{k} I_k^d . $$
(29)

It is possible to evaluate (28) explicitly. For k = 0 and for rδ + 1, we clearly have Ik = 1. For k ≥ 1 and 0 ≤ rδ + 1, the integral Ik takes different forms depending on the values of r and δ:

$$ I_k= \left\{ \begin{array}{cl} (\delta-r) \left( \frac{r}{\delta}\right)^{k} - \frac{2\delta}{(k+1)} \left\{\left( \frac{\delta+r-1}{2\delta} \right)^{k+1} - \left( \frac{r}{\delta} \right)^{k+1} \right \} & \text{for}\ r\leq \delta \\ &\\ (r-\delta) - \frac{2\delta}{(k+1)} \left\{ \left( \frac{\delta+r-1}{2\delta} \right)^{k+1} - 1 \right \} & \text{for}\ 0 \leq r - \delta \leq 1, r+\delta \geq1 \\ &\\ (r-\delta) + {2\delta }{/(k+1)} & \text{for}\ 0 \leq r - \delta \leq 1, r+\delta \leq 1. \end{array} \right. $$

In Figs. 16a and 16b, we depict values of \(C_{d,\mathbb {Z}_{n},r}\) (computed using (29)) as a function of δ for a number of choices of r. As in Section 3.5, we note that the δ-effect holds for the problem of coverage of the cube by smaller cubes.

Fig. 16
figure 16

(a) n = 50, r ∈ [0.7,0.85] increasing by 0.05, (b) n = 128, r ∈ [0.6,0.8] increasing by 0.05

5 Quantization

In this section, we briefly consider the following problem of quantization also known as the problem of facility location. Let X = (x1, … , xd) be uniform on \({\mathcal {C}}_{d}=[-1,1]^{d}\) and \(\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}\) be an n-point design. The mean square quantization error is \(\theta _{n}=\theta (\mathbb {Z}_{n})=\mathbb {E}_{X}\underset {i=1, \ldots , n}{\min \limits } \|X-Z_{i}\|^{2}\). In the case where Z1, … , Zn are i.i.d. uniform on \({\mathcal {C}}_{d}(\delta )\), we will derive a simple approximation for the expected value of \(\theta (\mathbb {Z}_{n})\) in order to demonstrate the δ-effect. We shall also notice a strong correlation in design efficiency used for quantization and for (1 − γ)-covering as studied in Section 3.

The two characteristics, \(C_{d}(\mathbb {Z}_{n},r)\) and \( \theta (\mathbb {Z}_{n})\), are related as follows. \(\mathbb {E}_{\mathbb {Z}_{n}} C_{d}(\mathbb {Z}_{n},r)\), as a function of r ≥ 0, is the c.d.f. of the r.v. \(\mathbb {E}_{\mathbb {Z}_{n}}\min \limits _{i=1, \ldots , n} \|X-Z_{i}\|\), where X is uniform on [− 1,1]d, while \(\mathbb {E}_{\mathbb {Z}_{n}}\theta (\mathbb {Z}_{n})\) is the second moment of the distribution with this c.d.f.:

$$ \mathbb{E}_{\mathbb{Z}_{n}} \theta(\mathbb{Z}_{n})= \mathbb{E}_{\mathbb{Z}_{n}} \mathbb{E}_{X}\underset{i=1, \ldots, n}{\min} \|X-Z_{i}\|^{2} = \mathbb{E}_{X} \mathbb{E}_{\mathbb{Z}_{n}} \underset{i=1, \ldots, n}{\min} \|X-Z_{i}\|^{2} = {\int}_{r\geq 0} r^{2} d \mathbb{E}_{\mathbb{Z}_{n}}C_{d}(\mathbb{Z}_{n},r) . $$
(30)

Using approximation (26) we obtain

$$ \begin{array}{@{}rcl@{}} \frac{d}{dr}(\mathbb{E}_{\mathbb{Z}_{n} } C_{d}(\mathbb{Z}_{n},r)) \cong f_{\delta}(r) &:=& \frac{n \cdot r}{\delta} {\int}_{-\infty}^{\infty} \frac{\varphi(s)\varphi(c_{s}) \psi_{2}(s)}{\sqrt{s^{\prime}+k}} \times \\ &\times& \left [\sqrt{3}+\left( 1+\frac{4}{d} \right) \frac{ \left( s^{\prime}+\frac{d\delta^{2}}{63} \right)}{ 5\left (s^{\prime}+k\right)^{3/2} } \left\{{\delta(c_{s}^{3}-c_{s})} - \frac{\sqrt{3}(r^{2}-\frac{d\delta^{2}}{3}-s^{\prime})}{ \sqrt{s^{\prime}+k}} \right\} \right] ds \end{array} $$

with

$$ \begin{array}{@{}rcl@{}} \psi_{2}(s)=\exp \left\{-n \left( {\varPhi}(c_{s}) + \left( 1+\frac{4}{d} \right)\frac{ \delta \left[ s^{\prime}+\frac{d\delta^{2}}{63} \right]}{ 5\sqrt{3}\left [s^{\prime}+k\right]^{3/2} }(1-c_{s}^{2})\phi(c_{s}) \right) \right\} \end{array} $$

and

$$ c_{s}= \frac{\sqrt{3}\left( r^{2} -s^{\prime} - \frac{d\delta^{2}}{3} \right)}{2\delta\sqrt{ s^{\prime}+k }}, s^{\prime}=s\sqrt{\frac{4d}{45}}+d/3 , k = \frac{d\delta^{2}}{15} . $$

Therefore using relation (30) the approximation for \(\mathbb {E}_{\mathbb {Z}_{n}} \theta (\mathbb {Z}_{n})\) for Scheme 1 is:

$$ \mathbb{E} \theta_n=\mathbb{E}_{\mathbb{Z}_{n}} \theta(\mathbb{Z}_{n}) \cong {\int}_{r\geq 0} r^{2} f_{\delta}(r) dr . $$
(31)

In Figs. 17a, 17b, 18a and 18b, we asses the accuracy of the approximation (31). In these figures, the solid black line corresponds to \(\mathbb {E} \theta _{n} \) obtained via Monte Carlo methods and the dashed red line depicts the approximation. We see that the accuracy of approximation (31) is very high for small n and large d. However, as Fig. 17b shows, if d is not large enough but n is large then the errors accumulate and the resulting approximation may not be accurate enough.

Fig. 17
figure 17

(a) \(\mathbb {E} \theta _{n}\) and approximation (31); n = 128, (b) \(\mathbb {E} \theta _{n}\) and approximation (31); n = 512

Fig. 18
figure 18

(a) \(\mathbb {E} \theta _{n}\) and approximation (31); n = 128, (b) \(\mathbb {E} \theta _{n}\) and approximation (31); n = 512

As follows from results of [8, Ch.6], for efficient covering schemes the order of convergence of the covering radius to 0 as \(n \to \infty \) is n− 1/d. Therefore, for the mean squared distance (which is the quantization error) we should expect the order n− 2/d as \(n \to \infty \). Therefore, for sake of comparison of quantization errors θn across n we renormalize this error from \(\mathbb {E} \theta _{n}\) to \(n^{2/d} \mathbb {E} \theta _{n}\).

In Tables 45 and 6, we present the minimum value of \(n^{2/d} \mathbb {E} \theta _{n}\) for a selection of the schemes among those considered in Section 3. In these tables, the value within the parentheses corresponds to the value of δ where the minimum of \(n^{2/d} \mathbb {E} \theta _{n}\) was obtained. For Scheme 3, typical behaviour of \(\mathbb {E} \theta _{n}\) across δ for a number and n and d is presented in Figs. 19a, 19b, 20a and 20b.

Fig. 19
figure 19

(a) \(\mathbb {E} \theta _{n}\) with n = 128, (a) \(\mathbb {E} \theta _{n}\) with n = 512

Fig. 20
figure 20

(a) \(\mathbb {E} \theta _{n}\) with n = 128, (b) \(\mathbb {E} \theta _{n}\) with n = 512

Table 4 Minimum value of \(n^{2/d} \mathbb {E} \theta _{n}\) and δ (in parentheses) across schemes and n for d = 10
Table 5 Minimum value of \(n^{2/d} \mathbb {E} \theta _{n}\) and δ (in parentheses) across schemes and n for d = 20
Table 6 Minimum value of \(n^{2/d} \mathbb {E} \theta _{n}\) and δ (in parentheses) across schemes and n for d = 50.

We make the following two main conclusions from analysing results of this numerical study:

  • (a) the presence of a strong δ-effect, very similar to the effect observed in Section 3, and

  • (b) for a given design \(\mathbb {Z}_{n}\), there is a very strong correlation between the covering probability as studied in Section 3 and the normalized quantization error \(n^{2/d}\mathbb {E} \theta (\mathbb {Z}_{n})\).

By comparing the values of δ in Tables 45 and 6 with those in Tables 12 and 3, we see a strong similarity between efficient quantization schemes and efficient covering schemes.