Covering of High-Dimensional Cubes and Quantization

Zhigljavsky, Anatoly; Noonan, Jack

doi:10.1007/s43069-020-0015-8

Covering of High-Dimensional Cubes and Quantization

Original Research
Open access
Published: 13 August 2020

Volume 1, article number 18, (2020)
Cite this article

Download PDF

You have full access to this open access article

SN Operations Research Forum Aims and scope Submit manuscript

Covering of High-Dimensional Cubes and Quantization

Download PDF

Anatoly Zhigljavsky¹ &
Jack Noonan¹

2303 Accesses
8 Citations
Explore all metrics

Abstract

As the main problem, we consider covering of a d-dimensional cube by n balls with reasonably large d (10 or more) and reasonably small n, like n = 100 or n = 1000. We do not require the full coverage but only 90% or 95% coverage. We establish that efficient covering schemes have several important properties which are not seen in small dimensions and in asymptotical considerations, for very large n. One of these properties can be termed ‘do not try to cover the vertices’ as the vertices of the cube and their close neighbourhoods are very hard to cover and for large d there are far too many of them. We clearly demonstrate that, contrary to a common belief, placing balls at points which form a low-discrepancy sequence in the cube, results in a very inefficient covering scheme. For a family of random coverings, we are able to provide very accurate approximations to the coverage probability. We then extend our results to the problems of coverage of a cube by smaller cubes and quantization, the latter being also referred to as facility location. Along with theoretical considerations and derivation of approximations, we provide results of a large-scale numerical investigation.

Efficient Quantisation and Weak Covering of High Dimensional Cubes

Article Open access 03 June 2022

Non-lattice Covering and Quantization of High Dimensional Sets

Capacitated Covering Problems in Geometric Spaces

Article 29 August 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In this paper, we develop and study efficient schemes for covering and quantization in high-dimensional cubes. In particular, we will demonstrate that the proposed schemes are much superior to the so-called low-discrepancy sequences. The paper starts with introducing the main notation, then we formulate the main problem of covering a d-dimensional cube by n Euclidean balls. This is followed by a discussion on the main principles we have adopted for construction of our algorithms. Then we briefly formulate problems of covering a cube by smaller cubes (which are balls in the $L_{\infty }$-norm) and the problem of quantization. Both problems have many similarities with the main problem of covering a cube by n balls. At the end of this section, we describe the structure of the remaining sections of the paper and summarize our main findings.

1.1 Main Notation

$\mathbb {R}^{d}$: d-dimensional space; vectors in $\mathbb {R}^{d}$ are row-vectors;
∥⋅∥ and $\|\cdot \|_{\infty }$: Euclidean and $L_{\infty }$-norms in $\mathbb {R}^{d}$;
${{\mathscr{B}}}_{d}(Z,{ r })= \{ Y \in \mathbb {R}^{d}: \| Y-Z \| \leq r\}$: d-dimensional ball of radius r centred at $Z \in \mathbb {R}^{d}$;
${{\mathscr{B}}}_{d}({ r })= {{\mathscr{B}}}_{d}(0,{ r })=\{ Y \in \mathbb {R}^{d}: \| Y \| \leq { r }\}$;
${\mathcal {S}}_{d}(Z,r)= \{ Y \in \mathbb {R}^{d}: \| Y-Z \| = { r } \}$: d-dimensional sphere of radius r centred at $Z \in \mathbb {R}^{d}$;
${\mathcal {C}}_{d}(Z,\delta )=\{ Y \in \mathbb {R}^{d}: \| Y -Z\|_{\infty } \leq { \delta } \}$: d-dimensional cube of side length 2δ centred at Z (it is also the d-dimensional ball in the $L_{\infty }$-norm with radius δ and centre Z);
${\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}={\mathcal {C}}_{d}(0,\delta )$;
${\mathcal {C}}_{d}=[-1,1]^{d}={\mathcal {C}}_{d}(1) $.

1.2 Main Problem of Interest

The main problem discussed in the paper is the following problem of covering a cube by n balls. Let ${\mathcal {C}}_{d}=[-1,1]^{d}$ be a d-dimensional cube, Z₁, … , Z_n be some points in $\mathbb {R}^{d}$ and ${{\mathscr{B}}}_{d}(Z_{j},r)$ be the corresponding balls of radius r centred at Z_j(j = 1, … , n). The dimension d, the number of balls n and their radius r could be arbitrary.

We are interested in the problem of choosing the locations of the centres of the balls Z₁, … , Z_n so that the union of the balls $\cup _{j}{{\mathscr{B}}}_{d}(Z_{j},r)$ covers the largest possible proportion of the cube ${\mathcal {C}}_{d}$. That is, we are interested in choosing a design (a collection of points) $\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}$ so that

$$ C_{d}(\mathbb{Z}_{n},r):= \text{vol} ({\mathcal{C}}_{d} \cap {\mathcal{B}}_{d}(\mathbb{Z}_{n},r))/2^{d} $$

(1)

is as large as possible (given n, r and the freedom we are able to use in choosing Z₁, … , Z_n). Here

$$ {\mathcal{B}}_{d}(\mathbb{Z}_{n},r)= \bigcup_{j=1}^{n} {\mathcal{B}}_{d}(Z_j,r) $$

(2)

and $C_{d}(\mathbb {Z}_{n},r)$ is the proportion of the cube ${\mathcal {C}}_{d}$ covered by the balls ${{\mathscr{B}}}_{d}(Z_{j},r)(j=1, \ldots , n)$. If points $Z_{j} \in \mathbb {Z}_{n}$ are random then by $C_{d}(\mathbb {Z}_{n},r)$ we will mean $\mathbb {E}_{\mathbb {Z}_{n}}C_{d}(\mathbb {Z}_{n},r)$ but we are not going to stress this in notation unless it is important.

For a design $\mathbb {Z}_{n}$, its covering radius is defined by CR$(\mathbb {Z}_{n})= \max \limits _{X \in {\mathcal {C}}_{d}} \min \limits _{Z_{j} \in \mathbb {Z}_{n}}\|X-Z_{j}\|$. In computer experiments, covering radius is called minimax-distance criterion (see [5] and [13]); in the theory of low-discrepancy sequences, covering radius is called dispersion (see [8, Ch. 6]).

The problem of optimal covering of a cube by n balls has very high importance for the theory of global optimization and many branches of numerical mathematics. In particular, the n-point designs $\mathbb {Z}_{n}$ with smallest CR provide the following: (a) the n-point min-max optimal quadratures (see [14, Ch.3,Th.1.1]), (b) min-max n-point global optimization methods in the set of all adaptive n-point optimization strategies (see [14, Ch.4,Th.2.1]), and (c) worst-case n-point multi-objective global optimization methods in the set of all adaptive n-point algorithms (see [20]). In all three cases, the class of (objective) functions is the class of Liptshitz functions, where the Liptshitz constant may be unknown. The results (a) and (b) are the celebrated results of A.G.Sukharev obtained in the late nineteen-sixties and (c) is a recent result of A. žilinskas.

If d is not small (say, d > 5) then computation of the covering radius CR$(\mathbb {Z}_{n}) $ for any non-trivial design $\mathbb {Z}_{n}$ is a very difficult computational problem. This explains why the problem of construction of optimal n-point designs with smallest covering radius is notoriously difficult (see, for example, recent surveys [16, 17]).

If $r= \textit {CR}(\mathbb {Z}_{n})$, then $C_{d}(\mathbb {Z}_{n},r)$ defined in (1) is equal to 1, and the whole cube ${\mathcal {C}}_{d}$ gets covered by the balls. However, we are only interested in reaching the values like 0.9, when a large part of the ball is covered. There are two main reasons why we are not interested in reaching the value $C_{d}(\mathbb {Z}_{n},r)= 1$: (a) practical impossibility of making a numerical checking of the full coverage, if d is large enough, and (b) our approximations lose accuracy when $C_{d}(\mathbb {Z}_{n},r)$ closely approaches 1.

If, for a given γ ∈ [0, 1), we have $C_{d}(\mathbb {Z}_{n},r)\geq 1-\gamma $, then the corresponding coverage of ${\mathcal {C}}_{d}$ will be called (1 − γ)-coverage; the corresponding value of r can be called (1 − γ)-covering radius. If γ = 0 then the (1 − γ)-coverage becomes the full coverage and 1-covering radius of $\mathbb {Z}_{n}$ becomes $C_{d}(\mathbb {Z}_{n},r)$. Of course, for any $\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}$ we can reach $C_{d}(\mathbb {Z}_{n},r)= 1$ by means of increasing r. Likewise, for any given r we can reach $C_{d}(\mathbb {Z}_{n},r)= 1$ by sending $n \to \infty $. However, we are not interested in very large values of n and try to get the coverage of the most part of the cube ${\mathcal {C}}_{d}$ with the radius r as small as possible. We will keep in mind the following typical values of d and n: d = 10,20,50; n = 64,128,512,1024. Correspondingly, we will illustrate our results in such scenarios.

1.3 Two Contradictory Criteria and a Compromise

In choosing $\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}$, the following two main criteria must be followed:

(i) the volumes of intersections of the cube ${\mathcal {C}}_{d}$ and each individual ball ${{\mathscr{B}}}_{d}(Z_{j},r)$ are not very small;
(ii) the volumes of intersections ${{\mathscr{B}}}_{d}(Z_{j},r) \cap {{\mathscr{B}}}_{d}(Z_{i},r)$ are small for all i≠j(i,j = 1, … , n).

These two criteria do not agree with each other. Indeed, as shown in Section 2 (see formulas (11)–(14)), the volume of intersection of the ball ${{\mathscr{B}}}_{d}(Z,r)$ and the cube ${\mathcal {C}}_{d}$ is approximately inversely proportional to ∥Z∥ and hence criterion (i) favours Z_j with small norms. However, if at least some of the points Z_j get close to 0, then the distance between these points gets small and, in view of the formulas of Section A1.7, the volumes of intersections ${{\mathscr{B}}}_{d}(Z_{j},r) \cap {{\mathscr{B}}}_{d}(Z_{i},r)$ get large.

This yields that the above two criteria require a compromise in the rule of choosing $\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}$ as the points Z_j should not be too far from 0 but at the same time, not too close. In particular, and this is clearly demonstrated in many examples throughout the paper, the so-called uniformly distributed sequences of points in ${\mathcal {C}}_{d}$, including ‘low-discrepancy sequences’ in ${\mathcal {C}}_{d}$, provide poor covering schemes. This is in a sharp contrast with the asymptotic case $n \to \infty $ (and hence r → 0), when one of the recommendations (see [2, p. 84]) is to choose Z_j’s from a uniformly distributed sequence of points from a set which is slightly larger than ${\mathcal {C}}_{d}$; this is to facilitate covering of the boundary of ${\mathcal {C}}_{d}$, as it is much easier to cover the interior of the cube ${\mathcal {C}}_{d}$ than its boundary.

In our considerations, n is not very large and hence the radius of balls r cannot be small. One of our recommendations for choosing $\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}$ is to choose Z_j’s at random in a cube ${\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}$ (with 0 < δ ≤ 1) with components distributed according to a suitable beta-distribution. The optimal value of δ is always smaller than 1 and depends on d and n. If d is small or n is astronomically large, then the optimal value of δ could be close to 1 but in most interesting instances this value is significantly smaller than 1. This implies that the choice δ = 1 (for example, if Z_j’s form a uniformly distributed sequence of points in the whole cube ${\mathcal {C}}_{d}$) often leads to very poor covering schemes, especially when the dimension d is large (see Tables 1, 2 and 3 in discussed in Section 3). More generally, we show that for construction of efficient designs $\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}$, either deterministic or randomized, we have to restrict the norms of the design points Z_j. We will call this principle ‘δ-effect’.

Table 1 Values of r and δ (in parentheses) to achieve 0.9 coverage for d = 10

Full size table

Table 2 Values of r and δ (in parentheses) to achieve 0.9 coverage for d = 20

Full size table

Table 3 Values of r and δ (in parentheses) to achieve 0.9 coverage for d = 50

Full size table

1.4 Covering a Cube by Smaller Cubes and Quantization

In Section 4 we consider the problem of (1 − γ)-coverage of the cube ${\mathcal {C}}_{d}=[-1,1]^{d}$ by smaller cubes (which are $L_{\infty }$-balls). The problem of 1-covering of cube by cubes has attracted a reasonable attention in mathematical literature (see, e.g., [3, 6]). The problem of (1 − γ)-coverage of a cube by cubes happened to be simpler than the main problem of (1 − γ)-coverage of a cube by Euclidean balls and we have managed to derive closed-form expressions for (a) the volume of intersection of two cubes, and (b) (1 − γ) coverage, the probability of covering a random point in ${\mathcal {C}}_{d}$ by n cubes ${\mathcal {C}}_{d}(Z_{i},r)$ for a wide choice of randomized schemes of choosing designs $\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}$. The results of Section 4 show that the δ-effect holds for the problem of coverage of the cube by smaller cubes in the same degree as for the main problem of Section 3 of covering with balls.

Section 5 is devoted to the following problem of quantization also known as the problem of facility location. Let X = (x₁, … , x_d) be uniform on ${\mathcal {C}}_{d}=[-1,1]^{d}$ and $\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}$ be an n-point design. The mean square quantization error is $\theta _{n}=\theta (\mathbb {Z}_{n}):=\mathbb {E}_{X}\underset {i=1, \ldots , n}{\min \limits } \|X-Z_{i}\|^{2}$. In the case where Z₁, … , Z_n are i.i.d. uniform on ${\mathcal {C}}_{d}(\delta )$, we will derive a simple approximation for the expected value of $\theta (\mathbb {Z}_{n})$ and clearly demonstrate the δ-effect. Moreover, we will notice a strong similarity between efficient quantization designs and efficient designs constructed in Section 3.

1.5 Structure of the Paper and Main Results

In Section 2 we derive accurate approximations for the volume of intersection of an arbitrary d-dimensional cube with an arbitrary d-dimensional ball. These formulas will be heavily used in Section 3, which is the main section of the paper dealing with the problem of (1 − γ)-coverage of a cube by n balls. In Section 4 we extend some considerations of Section 3 to the problem of (1 − γ)-coverage of the cube ${\mathcal {C}}_{d}$ by smaller cubes. In Section 5 we argue that there is a strong similarity between efficient quantization designs and efficient designs of Section 3. In Appendix 1, we briefly mention several facts, used in the main part of the paper, related to high-dimensional cubes and balls. In Appendix 2, we prove two simple but very important lemmas about distribution and moments of certain random variables.

Our main contributions in this paper are:

an accurate approximation (18) for the volume of intersection of an arbitrary d-dimensional cube with an arbitrary d-dimensional ball;
an accurate approximation (26) for the expected volume of intersection of the cube ${\mathcal {C}}_{d}$ with n balls with uniform random centres $Z_{j} \in {\mathcal {C}}_{d}(\delta )$;
closed-form expression of Section 4.2 for the expected volume of intersection the cube ${\mathcal {C}}_{d}$ with n cubes with uniform random centres $Z_{j} \in {\mathcal {C}}_{d}(\delta )$;
construction of efficient schemes of quantization and (1 − γ)-coverage of the cube ${\mathcal {C}}_{d}$ by n balls;
large-scale numerical study.

We are preparing an accompanying paper [9] in which we will further explore the topics of Sections 3–5 and also consider the problems of quantization and (1 − γ)-coverage in the whole space $\mathbb {R}^{d}$ and the problem of (1 − γ)-coverage of simplices.

2 Volume of Intersection of a Cube and a Ball

2.1 The Main Quantity of Interest

Consider the following problem. Let us take the cube ${\mathcal {C}}_{d}=[-1,1]^{d}$ of volume $\text {vol}({\mathcal {C}}_{d})=2^{d}$ and a ball $ {{\mathscr{B}}}_{d}(Z,r) = \{Y \in \mathbb {R}^{d}: \| Y-Z \| \leq r\} $ centred at a point $Z=(z_{1}, \ldots , z_{d}) \in \mathbb {R}^{d}$; this point Z could be outside ${\mathcal {C}}_{d}$. Denote the fraction of the cube ${\mathcal {C}}_{d}$ covered by the ball ${{\mathscr{B}}}_{d}(Z, r)$ by

$$ C_{d,Z,{ r }}={\text{vol}({\mathcal{C}}_{d} \cap {\mathcal{B}}_{d}(Z,{ r }))}/2^{d} . $$

(3)

Our aim is to approximate C_d,Z,r for arbitrary d, Z and r. To do this, we shall use CLT (Central Limit Theorem). We will derive a CLT-based normal approximation in Section 2.3 and then, using an asymptotic expansion in the CLT for non-identically distributed r.v., we will improve this normal approximation in Section 2.4. In Section A1.8 we consider a more direct approach for approximating C_d,Z,r based on the use of characteristic functions and the fact that C_d,Z,r is a c.d.f. of ∥U − Z∥, where U = (u₁, … , u_d) is random vector with uniform distribution on ${\mathcal {C}}_{d}$. From this, C_d,Z,r can be expressed through the convolution of one-dimensional c.d.f’s. Using this approach we can evaluate the quantity C_d,Z,r with high accuracy but the calculations are rather time-consuming. Moreover, entirely new computations have to be made for different Z and, therefore, we much prefer the approximation of Section 2.4.

Note that in the special case Z = 0, several approximations for the quantity C_d,0,r have been derived in [15] but their methods cannot be generalized to arbitrary Z.

2.2 A Generalization of the Quantity (3)

In the next sections, we will need another quantity which slightly generalizes (3). Assume that we have the cube ${\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}$ of volume $\text {vol}({\mathcal {C}}_{d}(\delta ))=(2\delta )^{d}$, the ball ${{\mathscr{B}}}_{d}(Z^{\prime },{ r^{\prime } })= \{ Y \in \mathbb {R}^{d}: \| Y-Z^{\prime } \| \leq { r^{\prime } }\}$ with a centre at a point $Z^{\prime }=(z_{1}^{\prime }, \ldots , z_{d}^{\prime }) $. Denote the fraction of the cube ${\mathcal {C}}_{d}(\delta )$ covered by the ball ${{\mathscr{B}}}_{d} (Z^{\prime }, r^{\prime })$ by

$$ {C}_{d,Z^{\prime},{ r^{\prime} }}^{(\delta)}={\text{vol}({\mathcal{C}}_{d}(\delta) \cap {\mathcal{B}}_{d(Z^{\prime}},{ r^{\prime} }))}/(2\delta)^d . $$

(4)

Then, by changing the coordinates and the radius

$$ Z=Z^{\prime}/\delta= ({z}_{1}^{\prime}/\delta, \ldots, z_{d}^{\prime}/\delta) \text{and } r= r^{\prime}/\delta , $$

(5)

we obtain

$$ {C}_{d,Z^{\prime},{ r^{\prime} }}^{(\delta)}=C_{d,Z,{ r }} . $$

(6)

2.3 Normal Approximation for the Quantity (3)

Let U = (u₁, … , u_d) be a random vector with uniform distribution on ${\mathcal {C}}_{d}$ so that u₁, … , u_d are i.i.d.r.v. uniformly distributed on [− 1, 1]. Then for given $Z=(z_{1}, \ldots , z_{d}) \in \mathbb {R}^{d}$ and any r > 0,

$$ C_{d,Z,{ r }} = \mathbb{P} \left\{ \| U - Z \| \leq { r } \right\} = \mathbb{P} \left\{ \| U - Z \|^{2} \leq { r^{2} } \right\} = \mathbb{P} \left\{\sum\limits_{j=1}^{d} (u_j - z_j)^{2} \leq { r }^{2} \right\}. $$

That is, C_d,Z,r, as a function of r, is the c.d.f. of the r.v. ∥U − Z∥.

Let u have a uniform distribution on [− 1,1] and |z|≤ 1. In view of Lemma 1 of Appendix 2, the density of the r.v. η_z = (u − z)² is

$$ \varphi_{z(t)} = \left\{ \begin{array}{ll} 1/(2\sqrt{t}) & \text{for} 0<t\leq (1- |z|)^{2}\\ 1/(4\sqrt{t}) & \text{for} (1- |z|)^{2}<t\leq (1+ |z|)^{2} \\ 0 & \text{otherwise} \end{array} \right. $$

(7)

and

$$ \mathbb{E}\eta_{z} =z^{2} +\frac{1}{3}, \text{var}(\eta_{z}) = \frac{4}{3} \left( z^{2} +\frac{1}{15} \right) , {\mu}_{z}^{(3)}= \frac{16}{15} \left( z^{2} +\frac{1}{63} \right) , $$

(8)

where ${\mu }_{z}^{(3)}$ is the third central moment: ${\mu }_{z}^{(3)}=E \left [\eta _{z} - E\eta _{z}\right ]^{3}$.

For |z| > 1, the density of η_z = (u − z)² is

$$ \varphi_{z(t)} = \left\{\begin{array}{ll} 1/(4\sqrt{t}) & \text{for } (1-|z|)^{2}<t\leq (1+|z|)^{2} \\ 0 & \text{otherwise} \end{array} \right. $$

(9)

with expressions (8) for $\mathbb {E}\eta _{z}$, var(η_z) and ${\mu }_{z}^{(3)}$ not changing.

Consider the r.v.

$$ \| U-Z \|^{2} ={\sum}_{i=1}^d \eta_{z_j}={\sum}_{j=1}^{d} (u_j-z_j)^{2} . $$

(10)

From (8), its mean is

$$ \mu_{d,Z}=\mathbb{E}\| U-Z \|^{2} =\|Z\|^{2} +\frac{d}{3} . $$

(11)

Using independence of u₁, … , u_d, we also obtain from (8):

$$ {\sigma}_{d,Z}^{2}=\text{var}(\| U-Z \|^{2} ) = \frac{4}{3} \left( \|Z\|^{2} +\frac{d}{15}\right) $$

(12)

and

$$ {\mu}_{d,Z}^{(3)}= \mathbb{E}\left[\| U-Z \|^{2}- \mu_{d,Z}\right]^{3} = {\sum}_{j=1}^{d} \mu_{z_j}^{(3)} = \frac{16}{15} \left( \|Z\|^{2} +\frac{d}{63}\right) . $$

(13)

If d is large enough then the conditions of the CLT for ∥U − Z∥² are approximately met and the distribution of ∥U − Z∥² is approximately normal with mean μ_d,Z and variance ${\sigma }_{d,Z}^{2}$. That is, we can approximate C_d,Z,r by

$$ C_{d,Z,{ r }} \cong {\varPhi} \left( \frac{{ r }^{2}-\mu_{d,Z}}{{\sigma}_{d,Z}} \right) , $$

(14)

where Φ(⋅) is the c.d.f. of the standard normal distribution:

$$ {\varPhi} (t) = {\int}_{-\infty}^{t} \varphi(v)dv \text{with} \varphi(v)=\frac{1}{\sqrt{2\pi}} e^{-v^{2}/2} . $$

The approximation (14) has acceptable accuracy if C_d,Z,r is not very small; for example, it falls inside a 2σ-confidence interval generated by the standard normal distribution (see Figs. 1a and 1b as examples). Let p_β be the quantile of the standard normal distribution defined by Φ(β) = 1 − p_β; for example, $p_{\beta } \simeq 0.05$ for β = 2. As follows from (11), (12) and the approximation (14), we expect the approximate inequality $C_{d,Z,{ r }} \gtrapprox p_{\beta }$ if

$$ r \geq R_{d,\|Z\|,\beta}=\left[\|Z\|^{2}+d/3 - 2\beta \sqrt{\|Z\|^{2}/3+d/45 }\right]^{1/2} . $$

(15)

In many cases discussed in Section 3, the radius r does not satisfy the inequality (15) with β = 2 and even β = 3 and hence the normal approximation (14) is not satisfactorily accurate; this can be evidenced from Figs. 1a, 1b, 2a, 2b, 3a, 3b, 4a, 4b, 5a, 5b, 6a, 6b, 7a, 7b, 8a and 8b.

In the next section, we improve the approximation (14) by using an Edgeworth-type expansion in the CLT for sums of independent non-identically distributed r.v.

2.4 Improved Normal Approximation

General expansion in the central limit theorem for sums of independent non-identical r.v. has been derived by V. Petrov (see Theorem 7 in Chapter 6 in [10]; see also Proposition 1.5.7 in [12]). The first three terms of this expansion have been specialized by V. Petrov in Section 5.6 in [11]. By using only the first term in this expansion, we obtain the following approximation for the distribution function of ∥U − Z∥²:

$$ P\left( \frac{\| U-Z \|^{2}-\mu_{d,Z}}{\sigma_{d,Z}} \leq x \right) \cong {\varPhi}(x) + \frac{ {\mu}_{d,Z}^{(3)}}{6 \left( {\sigma}_{d,Z}^{2}\right)^{3/2} }(1-x^{2})\varphi(x), $$

leading to the following improved form of (14):

$$ C_{d,Z,{ r }} \cong {\varPhi}(t) + \frac{ \|Z\|^{2}+d/63}{5\sqrt{3} (\|Z\|^{2}+d/15)^{3/2} }(1-t^{2})\varphi(t) , $$

(16)

where

$$ t = t_{d,\|Z\|,{ r }}= \frac{{ r }^{2}-\mu_{d,Z}}{{\sigma}_{d,Z}} = \frac{\sqrt{3}(r^{2}- \|Z\|^{2} -d/3)}{2\sqrt{ \|Z\|^{2} +{d}/{15}} } . $$

(17)

From the viewpoint of Section 3, the range of most important values of t from (17) is − 3 ± 1. For such values of t, the uncorrected normal approximation (14) significantly overestimates the values of C_d,Z,r (see Figs. 1a, 1b, 2a, 2b, 3a, 3b, 4a, 4b, 5a, 5b, 6a, 6b, 7a, 7b, 8a and 8b below). The approximation (16) brings the normal approximation down and makes it much more accurate. The other terms in Petrov’s expansion of [10] and [11] continue to bring the approximation down (in a much slower fashion) so that the approximation (16) still slightly overestimates the true value of C_d,Z,r (at least, in the range of interesting values of t from (17)). However, if d is large enough (say, d ≥ 20) then the approximation (16) is very accurate and no further correction is needed.

A very attractive feature of the approximations (14) and (17) is their dependence on Z through ∥Z∥ only. We could have specialized for our case the next terms in Petrov’s approximation but these terms no longer depend on ∥Z∥ only (this fact can be verified from the formula (46) for the fourth moment of the r.v. ν_z = (z − u)²) and hence the next terms are much more complicated. Moreover, adding one or two extra terms from Petrov’s expansion to the approximation (16) does not fix the problem entirely for all Z and r. Instead, we propose a slight adjustment to the r.h.s of (16) to improve this approximation, especially for small dimensions. Specifically, we suggest the approximation

$$ C_{d,Z,{ r }} \cong {\varPhi}(t) + c_d\frac{\|Z\|^{2}+d/63}{5\sqrt{3} (\|Z\|^{2}+d/15)^{3/2} }(1-t^{2})\varphi(t) , $$

(18)

where c_d = 1 + 3/d if the point Z lies on the diagonal of the cube ${\mathcal {C}}_{d}$ and c_d = 1 + 4/d for a typical (random) point Z. For typical (random) points $Z \in {\mathcal {C}}_{d}$, the values of C_d,Z,r are marginally smaller than for the points on the diagonal of ${\mathcal {C}}_{d}$ having the same norm, but the difference is very small. In addition to the points on the diagonal, there are other special points: the points whose components are all zero except for one. For such points, the values of C_d,Z,r are smaller than for typical points Z with the same norm, especially for small r. Such points, however, are of no value for us as they are not typical and we have never observed in simulations random points that come close to these truly exceptional points.

2.5 Simulation Study

In Figs.1a, 1b, 2a, 2b, 3a, 3b, 4a, 4b, 5a, 5b, 6a, 6b, 7a, 7b, 8a and 8b we demonstrate the accuracy of approximations (14), (16) and (18) for C_d,Z,r in dimensions d = 10,50 for the following locations of Z:

(i) Z = 0, the centre of the cube ${\mathcal {C}}_{d}$;
(ii) $\|Z\|=\sqrt {d}$, with Z being a vertex of the cube ${\mathcal {C}}_{d}$;
(iii) Z lies on a diagonal of ${\mathcal {C}}_{d}$ with |z_j| = λ ≥ 0 for all j = 1, … , d and $\|Z\|=\lambda \sqrt {d}$;
(iv) Z is a random vector uniformly distributed on the sphere ${\mathcal {S}}_{d}(0,v)$ with some v > 0.

There are figures of two types. In the figures of the first type, we plot C_d,Z,r over a wide range of r ensuring that values of C_d,Z,r lie in the whole range [0, 1]. In the figures of the second type, we plot C_d,Z,r over a much smaller range of r with C_d,Z,r lying in the range [0,ε] for some small positive ε such as ε = 0.015. For the purpose of using the approximations of Section 3, we need to assess the accuracy of all approximations for smaller values of C_d,Z,r and hence the second type of plots are often more insightful. In Figs. 1a, 1b, 2a, 2b, 3a, 3b, 4a, 4b, 5a, 5b, 6a, 6b, 7a and 7b the solid black line depicts values of C_d,Z,r computed via Monte Carlo methods, the blue dashed, the red dot-dashed and green long dashed lines display approximations (14), (16) and (18), respectively.

In the case where Z is a random vector uniformly distributed on a sphere ${\mathcal {S}}_{d}(0,v)$, the style of the figures of the second type is slightly changed to adapt for this choice of Z and provide more information for Z which do or do not belong to the cube ${\mathcal {C}}_{d}$. In Figs. 8a and 8b, the thick dashed red lines correspond to random points $Z \in {\mathcal {S}}_{d}(0,v) \cap {\mathcal {C}}_{d}$. The thick dot-dashed orange lines correspond to random points $Z \in {\mathcal {S}}_{d}(0,v)$ such that $Z\not \in {\mathcal {C}}_{d}$. Approximations (14) and (16) are depicted in the same manner as previous figures but the approximation (18) is now represented by a solid green line. The thick solid red line displays values of C_d,Z,r for Z on the diagonal of ${\mathcal {C}}_{d}$ with ∥Z∥ = v with v = 1.5 for d = 10 and v = 1.75 for d = 50.

From the simulations that led to Figs. 1a, 1b, 2a, 2b, 3a, 3b, 4a, 4b, 5a, 5b, 6a, 6b, 7a, 7b, 8a and 8b we can make the following conclusions.

The normal approximation (14) is quite satisfactory unless the value C_d,Z,r is small.
The accuracy of all approximations improves as d grows.
The approximation (18) is very accurate even if the values C_d,Z,r are very small.
If d is large enough then the approximations (16) and (18) are practically identical and are extremely accurate.

3 Covering a Cube by n Balls

In this section, we consider the main problem of covering the cube ${\mathcal {C}}_{d}=[-1,1]^{d}$ by the union of n balls ${{\mathscr{B}}}_{d}(Z_{j},r)$ as formulated in Section 1.2. We will discuss different schemes of choosing the set of ball centres $\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}$ for given d and n. The radius r will then be chosen to achieve the required probability of covering: $C_{d}(\mathbb {Z}_{n},r)\geq 1-\gamma $. Most of the schemes will involve one or several parameters which we will want to choose in an optimal way.

3.1 The Main Covering Scheme

The following will be our main scheme for choosing $\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}$.

Scheme 1. Z₁, … , Z_n are i.i.d. random vectors uniformly distributed in the cube ${\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}$, where δ ∈ [0, 1] is a parameter

We will formulate several other covering schemes and compare them with Scheme 1. The reasons why we have chosen Scheme 1 as the main scheme are the following.

It is easier to theoretically investigate than all other non-trivial schemes.
It includes, as a special case when δ = 1, the scheme which is very popular in practice of Monte Carlo [8] and global random search [18, 19] and is believed to be rather efficient (this is not true).
Numerical studies provided below show that Scheme 1 with optimal δ provides coverings which are rather efficient, especially for large d (see Section 3.5 for a discussion regarding this issue).

3.2 Theoretical Investigation of Scheme 1

Let Z₁, … , Z_n be i.i.d. random vectors uniformly distributed in the cube ${\mathcal {C}}_{d}(\delta )$ with 0 < δ ≤ 1. Then, for given $U = (u_{1}, \ldots , u_{d}) \in \mathbb {R}^{d}$,

$$ \begin{array}{@{}rcl@{}} \mathbb{P} \left\{ U \in {\mathcal{B}}_{d}(\mathbb{Z}_{n},r) \right\}&=& 1-\prod\limits_{j=1}^{n} \mathbb{P} \left\{ U \notin {\mathcal{B}}_{d}({Z}_j,r) \right\} \\ &=& 1-\prod\limits_{j=1}^{n}\left( 1-\mathbb{P} \left\{ U \in {\mathcal{B}}_{d}({Z}_j,r) \right\} \right) \\ &=& 1-\left( 1-\mathbb{P}_{_Z} \left\{ \|U - {Z} \| \leq r \right\} \right)^{n} \end{array} $$

(19)

where ${{\mathscr{B}}}_{d}(\mathbb {Z}_{n},r)$ is defined in (2). The main characteristic of interest $C_{d}(\mathbb {Z}_{n},r)$, defined in (1), the proportion of the cube covered by the union of balls ${{\mathscr{B}}}_{d}(\mathbb {Z}_{n},r)$, is simply

$$ C_{d}(\mathbb{Z}_{n},r) = \mathbb{E}_{_U} \mathbb{P} \left\{ U \in {\mathcal{B}}_{d}(\mathbb{Z}_{n},r) \right\} $$

(20)

Continuing (19), note that

$$ \mathbb{P}_{_Z} \left\{ \|U - {Z} \| \leq r \right\} = \mathbb{P}_{_Z} \left\{ \sum\limits_{j=1}^{d} (z_j-u_j)^{2} \leq r^{2} \right\} = {C}_{d,U,{ r }}^{(\delta)} , $$

(21)

where ${C}_{d,U,{ r }}^{(\delta )}$ is defined by the formula (4). From (5) and 6 we have $ \label {eq:inters2ae} {C}_{d,U,{ r }}^{(\delta )}=C_{d,U/\delta ,{ r/\delta }} $ where C_d,U/δ,r/δ is the quantity defined by (3). This quantity can be approximated in a number of different ways as shown in Section 2. We will compare (14), the simplest of the approximations, with the approximation given in (18). Approximation (14) gives

$$ {C}_{d,U,{ r }}^{(\delta)}=C_{d, U/\delta,{ r/ \delta }} \cong {\varPhi} \left( \frac{{ (r/ \delta) }^{2}- \| U \|^{2}/\delta^{2} -d/3 }{2 \sqrt{ \|U\|^{2}/(3\delta^{2}) + d/45} } \right) , $$

(22)

whereas approximation (18) provides

$$ {C}_{d,U,{ r }}^{(\delta)} \cong {\varPhi}(t_{\delta}) + c_d\frac{ \|U\|^{2}/\delta^{2}+d/63}{5\sqrt{3} (\|U\|^{2}/\delta^{2}+d/15)^{3/2} }(1-{t}_{\delta}^{2})\varphi(t_{\delta}) , $$

(23)

with c_d = 1 + 4/d and

$$ t_{\delta} = \frac{(r/\delta)^{2}- \|U\|^{2}/\delta^{2} -d/3}{2\sqrt{\|U\|^{2}/(3\delta^{2}) +d/45 } } . $$

From (37), $\mathbb {E} \| U \|^{2}= d/3$ and var(∥U∥²) = 4d/45. Moreover, if d is large enough then $\| U \|^{2}= {\sum }_{j=1}^{d} {u_{j}^{2}}$ is approximately normal.

We shall simplify the expression (19) by using the approximation

$$ (1-t)^{n} \simeq e^{-nt} , $$

(24)

which is a good approximation for small values of t and moderate values of nt; this agrees with the ranges of d, n and r we are interested in.

We can combine the expressions (20) and (19) with approximations (22), (23) and (24) as well as with the normal approximation for the distribution of ∥U∥², to arrive at two final approximations for $C_{d}(\mathbb {Z}_{n},r)$ that differ in complexity. If the original normal approximation of (22) is used then we obtain

$$ C_{d}(\mathbb{Z}_{n},r) \simeq 1- {\int}_{-\infty}^{\infty} \psi_1(s) \varphi(s)d s, $$

(25)

with

$$ \psi_1(s)=\exp \left\{-n {\varPhi}(c_{s}) \right\} , \ c_{s}= \frac{{3 (r/\delta) }^{2}- s^{\prime} -d }{2 \sqrt{ s^{\prime} + d/5} } , s^{\prime}=(d+2s \sqrt{d/5})/\delta^{2} . $$

If approximation (23) is used, we obtain:

$$ C_d(\mathbb{Z}_n,r) \simeq 1- {\int}_{-\infty}^{\infty} \psi_2(s) \varphi(s)d s, $$

(26)

with

$$ \psi_{2}(s)=\exp \left\{-n \left( {\varPhi}(c_{s}) +\left( 1+\frac{4}{d}\right) \frac{s^{\prime}+d/21}{5[ s^{\prime} +d/5 ]^{3/2}}(1-{c}_s^{2})\varphi(c_{s}) \right) \right\} . $$

3.3 Simulation Study for Assessing Accuracy of Approximations (25) and (26)

In Figs. 9a, 9b, 10a, 10b, 11a and 11b, $C_{d}(\mathbb {Z}_{n},r) $ is represented by a solid black line and has been obtained via Monte Carlo methods. Approximation (25) is indicated by a dashed blue line and approximation (26) is represented by long dashed green lines. All figures demonstrate that approximation (26) is extremely accurate across different dimensions and values of n. This approximation is much superior to approximation (25).

3.4 Other Schemes

In addition to Scheme 1, we have also considered the following schemes for choosing $\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}$.

Scheme 2. Z₁ = 0; Z₂, … , Z_n are i.i.d. random vectors uniformly distributed in the cube ${\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}$
Scheme 3. Z₁, … , Z_n are taken from the minimum-aberration fractional factorial design on vertices of the cube ${\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}$
Scheme 4. Z₁, … , Z_n are i.i.d. random vectors on ${\mathcal {C}}_{d}(\delta )$ with independent components distributed according to beta-distribution with density (34) with some α > 0
Scheme 5. Z₁, … , Z_n are i.i.d. random vectors uniformly distributed in the ball ${{\mathscr{B}}}_{d}(\delta )$
Scheme 6. Z₁, … , Z_n are i.i.d. random vectors uniformly distributed on the sphere ${\mathcal {S}}_{d}(\delta )$
Scheme 7. Z₁, … , Z_n are taken from a low-discrepancy Sobol’s sequence on the cube ${\mathcal {C}}_{d}(\delta )$

The rationale behind the choice of these schemes is as follows. By studying Scheme 2, we test the importance of inclusion of 0 into $\mathbb {Z}_{n}$. We propositioned that if we included 0 into $\mathbb {Z}_{n}$, the optimal value of δ may increase for some of the schemes making them more efficient; this effect has not been detected.

Scheme 3 with optimal δ is an obvious candidate for being the most efficient. Unlike all other schemes considered, Scheme 3 is only defined for the values of n of the form n = 2^k with k ≤ d.

By using Scheme 4, we test the possibility of improving Scheme 1 by changing the distribution of points in the cube ${\mathcal {C}}_{d}(\delta )$. We have found that the effect of distribution is strong and smaller values of α lead to more efficient covering schemes. By choosing α small enough, like α = 0.1, we achieve the average efficiency of the covering schemes which is rather close to the efficiency of Scheme 3. Tables 1, 2 and 3 contain results obtained for Scheme 4 with α = 0.5 and α = 1.5; if α = 1 then Scheme 4 becomes Scheme 1.

From Section A1.4, we know that for constructing efficient designs we have to somehow restrict the norms of Z_j’s. In Schemes 5 and 6, we are trying to do this in an alternative way to Schemes 1 and 4.

Scheme 7 is a natural improvement of Scheme 1. As a particular case with δ = 1, it contains one of the best known low-discrepancy sequences and hence Scheme 7 with δ = 1 serves as the main benchmark with which we compare other schemes. For construction, we have used the R-implementation of the Sobol’s sequences; it is based on [4].

For all the schemes excluding Scheme 3, the sequences $\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}$ are nested so that $\mathbb {Z}_{n} \subset \mathbb {Z}_{m} $ for all n < m; using the terminology of [6], these schemes provide on-line coverings of the cube. Note that for the chosen values of n, Scheme 7 also has some advantage over other schemes considered. Indeed, despite Sobol’s sequences are nested, the values n of the form n = 2^k are special for the Sobol’s sequences and for such values of n the Sobol’s sequences possess extra uniformity properties that they do not possess for other values of n.

3.5 Numerical Comparison of Schemes

In Tables 1, 2 and 3, for Schemes 1, 2, 4, 5 and 6, we present the smallest values of r required to achieve an 0.9-coverage on average. For these schemes, the value inside the parentheses shows the average value of δ required to obtain 0.9-coverage. For Schemes 3 and 7, we give the smallest value of r needed for a 0.9-coverage. For these two schemes, the value within the bracket corresponds to the (non random) value of δ with which we attain such a coverage.

In Figs. 12a, 12b, 13a, 13b, 14a, 14b, 15a and 15b we plot $C_{d}(\mathbb {Z}_{n},r) $ as a functions of δ ∈ [0, 1] across a number schemes, n and d. For these plots we have used the values of r provided in Tables 1, 2 and 3 such that for Figs. 12a, 12b, 13a and 13b which correspond to Scheme 1 and Scheme 2, the maximum coverage is very close to 0.9 and the optimal δ is very close to the values presented in Tables 1, 2 and3. For Figs. 12a, 12b, 13a, 13b, 14a, 14b, 15a and 15b the maximum coverage 0.9 is attained with δ provided in Tables 1, 2 and 3. In Figs. 12a, 12b, 13a, 13b, 14a, 14b, 15a and 15b the solid green line, long dashed red line, dashed blue line and dot dashed orange line correspond to n = 64,128,512,and 1024 respectively. The vertical lines on these plots indicate the value of δ where the maximum coverage is obtained.

From Tables 1, 2 and 3 and Figs.12a, 12b, 13a, 13b, 14a, 14b, 15a and 15b we arrive at the following conclusions:

the δ-effect is very important and getting much stronger as d increases;
coverage of unadjusted low-discrepancy sequences is extremely low;
properly δ-tuned deterministic Scheme 3, which uses fractional factorial designs of minimum abberation, provides excellent covering;
randomized Scheme 4 with suitably chosen parameters of the beta-distribution, also provides very high quality coverage (on average);
for all schemes considered, the coverings with the optimal values of δ fully comply with the result of Section A1.4 describing the area of volume concentration in the cube ${\mathcal {C}}_{d}$.

4 Covering a Cube by Cubes

4.1 Volume of Intersection of Two Cubes

Let us take two cubes: ${\mathcal {C}}_{d}=[-1,1]^{d}$ and ${\mathcal {C}}_{d}(Z,r) = \{ Y \in \mathbb {R}^{d}: \| Y - Z\|_{\infty } \leq { r } \}$, a cube of side length 2r centred at a point $Z =(z_{1}, \ldots , z_{d}) \in {\mathcal {C}}_{d}$. Denote the fraction of the cube ${\mathcal {C}}_{d}$ covered by ${\mathcal {C}}_{d}(Z,r)$ by

$$ \begin{array}{@{}rcl@{}} F_{d,Z,r}={\text{vol}({\mathcal{C}}_{d} \cap {\mathcal{C}}_d(Z,{ r }))}/2^{d} . \end{array} $$

(27)

Let, like in Section 2.3, U = (u₁, … , u_d) be a random vector with uniform distribution on ${\mathcal {C}}_{d}$ so that u₁, … , u_d are i.i.d.r.v. uniformly distributed on [− 1,1]. Then

$$ F_{d,Z,{ r }}= \mathbb{P} \left\{ \| U-Z \|_{\infty} \leq { r } \right\}= \mathbb{P} \left\{\max_{1\leq j \leq d} |u_j-z_j| \leq r \right\} . $$

That is, F_d,Z,r, as a function of r, is the c.d.f. of the r.v. $\| U-Z \|_{\infty } = \max \limits _{1\leq j \leq d} |u_{j}-z_{j}|$.

From Lemma 2 of Appendix 2 the c.d.f. of the r.v. |u_j − z_j| is

$$ \begin{array}{@{}rcl@{}} G_{d,z_j}(t) = \mathbb{P}\{|u_j - z_j| \leq t \}= \left\{\begin{array}{cl} 0 & \text{for} t\leq 0\\ t & \text{for} 0<t< 1- |z_j|\\ \frac{1}{2}({1 + t - |z_j|}) & \text{for} 1- |z_j| \leq t\leq 1+ |z_j| \\ 1 & 1+ |z_j| < t . \end{array} \right. \end{array} $$

Since the c.d.f. of a maximum of independent r.v. is the product of marginal c.d.f.’s, we obtain

$$ F_{d,Z,{ r }}= {\prod}_{j=1}^{d} G_{d,z_{j}}(r) . $$

Two extreme particular cases of location of Z are:

(i) Z = 0: F_d,0,r = r^d, 0 ≤ r ≤ 1;
(ii) $\|Z\|=\sqrt {d}$, when Z being a vertex of the cube ${\mathcal {C}}_{d}$: F_d,V,r = (r/2)^d, 0 ≤ r ≤ 2.

Assume now that we have the cube ${\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}$ of volume (2δ)^d and another cube $ {\mathcal {C}}_{d}(Z^{\prime },{ r^{\prime } })= \{ Y \in \mathbb {R}^{d}: \| Y-Z^{\prime } \|_{\infty } \leq r^{\prime }\} $ with a centre at a point $Z^{\prime }=(z_{1}^{\prime }, \ldots , z_{d}^{\prime }) $. Denote the fraction of the cube ${\mathcal {C}}_{d}(\delta )$ covered by ${\mathcal {C}}_{d}(Z^{\prime }, r^{\prime })$ by

$$ {F}_{d,Z^{\prime},{ r^{\prime} }}^{(\delta)}={\text{vol}({\mathcal{C}}_{d}(\delta) \cap {\mathcal{C}}_d(Z^{\prime},{ r^{\prime} }))}/(2\delta)^d . $$

Then by changing the coordinates and the radius using (5) we get ${F}_{d,Z^{\prime },{ r^{\prime } }}^{(\delta )}=F_{d,Z/\delta ,{ r/\delta }} . $

4.2 Proportion of a Cube Covered by Smaller Cubes with Random Centres

Let us take the cube ${\mathcal {C}}_{d}=[-1,1]^{d}$ and n smaller cubes ${\mathcal {C}}_{d}(Z_{j},{ r })= \{ Y \in \mathbb {R}^{d}: \| Y-Z_{j} \|_{\infty } \leq { r } \}$ with centres at points $Z_{j} \in \mathbb {R}^{d}$. Denote the fraction of the cube ${\mathcal {C}}_{d}$ covered by ${\mathcal {C}}_{d}(\mathbb {Z}_{n},{ r })= \cup _{j=1}^{n} {\mathcal {C}}_{d}(Z_{j}, {r })$, the union of these cubes, by

$$ C_{d,\mathbb{Z}_{n},{ r }}={\text{vol}({\mathcal{C}}_{d} \cap {\mathcal{C}}_d(\mathbb{Z}_{n},{ r }))}/2^{d} . $$

Our aim is to obtain a closed form expression for this quantity for arbitrary d,r and n in the case when Z₁, … , Z_n are i.i.d. random vectors uniformly distributed in the cube ${\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}$ with 0 < δ ≤ 1.

Similarly to the combination of (19) with (21), for a given $U = (u_{1}, \ldots , u_{d}) \in \mathbb {R}^{d}$,

$$ \mathbb{P} \left\{ U \in {\mathcal{C}}_d(\mathbb{Z}_{n},r) \right\}= 1-\left( 1-F_{d, U/\delta,{ r/\delta }} \right)^{n} . $$

Similarly to (20),

$$ \begin{array}{@{}rcl@{}} C_{d,\mathbb{Z}_{n},{ r }} = \mathbb{E}_{_U} \mathbb{P} \left\{ U \in {\mathcal{C}}_d(\mathbb{Z}_{n},r) \right\} = 1- \mathbb{E}_{_U} \left( 1-F_{d, U/\delta,{ r/\delta }} \right)^{n}. \end{array} $$

For an integer k, set

$$ I_k = \frac{1}{2} {\int}_{-1}^{1} \left[G_{d,u/\delta}(r/\delta) \right]^{k}du . $$

(28)

Then, using the binomial theorem, we have

$$ C_{d,\mathbb{Z}_{n},{ r }} = 1 - {\sum}_{k=0}^{n}(-1)^{k} \binom{n}{k} I_k^d . $$

(29)

It is possible to evaluate (28) explicitly. For k = 0 and for r ≥ δ + 1, we clearly have I_k = 1. For k ≥ 1 and 0 ≤ r ≤ δ + 1, the integral I_k takes different forms depending on the values of r and δ:

$$ I_k= \left\{ \begin{array}{cl} (\delta-r) \left( \frac{r}{\delta}\right)^{k} - \frac{2\delta}{(k+1)} \left\{\left( \frac{\delta+r-1}{2\delta} \right)^{k+1} - \left( \frac{r}{\delta} \right)^{k+1} \right \} & \text{for}\ r\leq \delta \\ &\\ (r-\delta) - \frac{2\delta}{(k+1)} \left\{ \left( \frac{\delta+r-1}{2\delta} \right)^{k+1} - 1 \right \} & \text{for}\ 0 \leq r - \delta \leq 1, r+\delta \geq1 \\ &\\ (r-\delta) + {2\delta }{/(k+1)} & \text{for}\ 0 \leq r - \delta \leq 1, r+\delta \leq 1. \end{array} \right. $$

In Figs. 16a and 16b, we depict values of $C_{d,\mathbb {Z}_{n},r}$ (computed using (29)) as a function of δ for a number of choices of r. As in Section 3.5, we note that the δ-effect holds for the problem of coverage of the cube by smaller cubes.

5 Quantization

In this section, we briefly consider the following problem of quantization also known as the problem of facility location. Let X = (x₁, … , x_d) be uniform on ${\mathcal {C}}_{d}=[-1,1]^{d}$ and $\mathbb {Z}_{n}=\{Z_{1}, \ldots , Z_{n}\}$ be an n-point design. The mean square quantization error is $\theta _{n}=\theta (\mathbb {Z}_{n})=\mathbb {E}_{X}\underset {i=1, \ldots , n}{\min \limits } \|X-Z_{i}\|^{2}$. In the case where Z₁, … , Z_n are i.i.d. uniform on ${\mathcal {C}}_{d}(\delta )$, we will derive a simple approximation for the expected value of $\theta (\mathbb {Z}_{n})$ in order to demonstrate the δ-effect. We shall also notice a strong correlation in design efficiency used for quantization and for (1 − γ)-covering as studied in Section 3.

The two characteristics, $C_{d}(\mathbb {Z}_{n},r)$ and $ \theta (\mathbb {Z}_{n})$, are related as follows. $\mathbb {E}_{\mathbb {Z}_{n}} C_{d}(\mathbb {Z}_{n},r)$, as a function of r ≥ 0, is the c.d.f. of the r.v. $\mathbb {E}_{\mathbb {Z}_{n}}\min \limits _{i=1, \ldots , n} \|X-Z_{i}\|$, where X is uniform on [− 1,1]^d, while $\mathbb {E}_{\mathbb {Z}_{n}}\theta (\mathbb {Z}_{n})$ is the second moment of the distribution with this c.d.f.:

$$ \mathbb{E}_{\mathbb{Z}_{n}} \theta(\mathbb{Z}_{n})= \mathbb{E}_{\mathbb{Z}_{n}} \mathbb{E}_{X}\underset{i=1, \ldots, n}{\min} \|X-Z_{i}\|^{2} = \mathbb{E}_{X} \mathbb{E}_{\mathbb{Z}_{n}} \underset{i=1, \ldots, n}{\min} \|X-Z_{i}\|^{2} = {\int}_{r\geq 0} r^{2} d \mathbb{E}_{\mathbb{Z}_{n}}C_{d}(\mathbb{Z}_{n},r) . $$

(30)

Using approximation (26) we obtain

$$ \begin{array}{@{}rcl@{}} \frac{d}{dr}(\mathbb{E}_{\mathbb{Z}_{n} } C_{d}(\mathbb{Z}_{n},r)) \cong f_{\delta}(r) &:=& \frac{n \cdot r}{\delta} {\int}_{-\infty}^{\infty} \frac{\varphi(s)\varphi(c_{s}) \psi_{2}(s)}{\sqrt{s^{\prime}+k}} \times \\ &\times& \left [\sqrt{3}+\left( 1+\frac{4}{d} \right) \frac{ \left( s^{\prime}+\frac{d\delta^{2}}{63} \right)}{ 5\left (s^{\prime}+k\right)^{3/2} } \left\{{\delta(c_{s}^{3}-c_{s})} - \frac{\sqrt{3}(r^{2}-\frac{d\delta^{2}}{3}-s^{\prime})}{ \sqrt{s^{\prime}+k}} \right\} \right] ds \end{array} $$

with

$$ \begin{array}{@{}rcl@{}} \psi_{2}(s)=\exp \left\{-n \left( {\varPhi}(c_{s}) + \left( 1+\frac{4}{d} \right)\frac{ \delta \left[ s^{\prime}+\frac{d\delta^{2}}{63} \right]}{ 5\sqrt{3}\left [s^{\prime}+k\right]^{3/2} }(1-c_{s}^{2})\phi(c_{s}) \right) \right\} \end{array} $$

and

$$ c_{s}= \frac{\sqrt{3}\left( r^{2} -s^{\prime} - \frac{d\delta^{2}}{3} \right)}{2\delta\sqrt{ s^{\prime}+k }}, s^{\prime}=s\sqrt{\frac{4d}{45}}+d/3 , k = \frac{d\delta^{2}}{15} . $$

Therefore using relation (30) the approximation for $\mathbb {E}_{\mathbb {Z}_{n}} \theta (\mathbb {Z}_{n})$ for Scheme 1 is:

$$ \mathbb{E} \theta_n=\mathbb{E}_{\mathbb{Z}_{n}} \theta(\mathbb{Z}_{n}) \cong {\int}_{r\geq 0} r^{2} f_{\delta}(r) dr . $$

(31)

In Figs. 17a, 17b, 18a and 18b, we asses the accuracy of the approximation (31). In these figures, the solid black line corresponds to $\mathbb {E} \theta _{n} $ obtained via Monte Carlo methods and the dashed red line depicts the approximation. We see that the accuracy of approximation (31) is very high for small n and large d. However, as Fig. 17b shows, if d is not large enough but n is large then the errors accumulate and the resulting approximation may not be accurate enough.

As follows from results of [8, Ch.6], for efficient covering schemes the order of convergence of the covering radius to 0 as $n \to \infty $ is n^− 1/d. Therefore, for the mean squared distance (which is the quantization error) we should expect the order n^− 2/d as $n \to \infty $. Therefore, for sake of comparison of quantization errors θ_n across n we renormalize this error from $\mathbb {E} \theta _{n}$ to $n^{2/d} \mathbb {E} \theta _{n}$.

In Tables 4, 5 and 6, we present the minimum value of $n^{2/d} \mathbb {E} \theta _{n}$ for a selection of the schemes among those considered in Section 3. In these tables, the value within the parentheses corresponds to the value of δ where the minimum of $n^{2/d} \mathbb {E} \theta _{n}$ was obtained. For Scheme 3, typical behaviour of $\mathbb {E} \theta _{n}$ across δ for a number and n and d is presented in Figs. 19a, 19b, 20a and 20b.

Table 4 Minimum value of $n^{2/d} \mathbb {E} \theta _{n}$ and δ (in parentheses) across schemes and n for d = 10

Full size table

Table 5 Minimum value of $n^{2/d} \mathbb {E} \theta _{n}$ and δ (in parentheses) across schemes and n for d = 20

Full size table

Table 6 Minimum value of $n^{2/d} \mathbb {E} \theta _{n}$ and δ (in parentheses) across schemes and n for d = 50.

Full size table

We make the following two main conclusions from analysing results of this numerical study:

(a) the presence of a strong δ-effect, very similar to the effect observed in Section 3, and
(b) for a given design $\mathbb {Z}_{n}$, there is a very strong correlation between the covering probability as studied in Section 3 and the normalized quantization error $n^{2/d}\mathbb {E} \theta (\mathbb {Z}_{n})$.

By comparing the values of δ in Tables 4, 5 and 6 with those in Tables 1, 2 and 3, we see a strong similarity between efficient quantization schemes and efficient covering schemes.

References

Blum A, Hopcroft J, Kannan R (2020) Foundations of data science. Cambridge University Press, Cambridge
Book Google Scholar
Janson S (1986) Random coverings in several dimensions. Acta Mathematica 156:83–118
Article Google Scholar
Januszewski J, Lassak M (1994) On-line covering the unit cube by cubes. Discrete Comput Geom 12(4):433–438
Article Google Scholar
Joe S, Kuo FY (2008) Constructing Sobol sequences with better two-dimensional projections. SIAM J Sci Comput 30(5):2635–2654
Article Google Scholar
Johnson ME, Moore LM, Ylvisaker D (1990) Minimax and maximin distance designs. J Stat Plan Infer 26(2):131–148
Article Google Scholar
Kuperberg W (1994) On-line covering a cube by a sequence of cubes. Discrete Comput Geom 12(1):83–90
Article Google Scholar
Li S (2011) Concise formulas for the area and volume of a hyperspherical cap. Asian J Math Stat 4(1):66–70
Article Google Scholar
Niederreiter H (1992) Random number generation and quasi-Monte Carlo methods. SIAM, Philadelphia
Book Google Scholar
Noonan J, Zhigljavsky A (2021) Non-lattice covering and quantization in high dimensions. In: Black Box Optimization, Machine Learning and No-Free Lunch Theorems. Springer
Petrov VV (1975) Sums of independent random variables. Springer, Berlin
Book Google Scholar
Petrov VV (1995) Limit theorems of probability theory: sequences of independent random variables. Oxford Science, Oxford
Google Scholar
Prakasa Rao BLS (1987) Asymptotic theory of statistical inference. Wiley, New York
Google Scholar
Pronzato L, Müller W. G. (2012) Design of computer experiments: space filling and beyond. Stat Comput 22(3):681–701
Article Google Scholar
Sukharev A (1992) Minimax models in the theory of numerical methods. Springer Science & Business Media, Berlin
Book Google Scholar
Tibken B, Constales D, et al. (1997) The volume of the intersection of a concentric cube and ball in n-dimensional space: collection of approximations. In: SIAM Review, vol 39, pp 783–786
Tóth GF (2017) Packing and covering Handbook of discrete and computational geometry. Chapman and Hall, London, pp 27–66
Tóth GF, Kuperberg W (1993) Packing and covering with convex sets. In: Handbook of Convex Geometry. Elsevier, pp 799–860
Zhigljavsky A (1991) Theory of global random search. Kluwer, Norwell
Book Google Scholar
Zhigljavsky A, Zilinskas A (2007) Stochastic global optimization. Springer Science & Business Media, Berlin
Google Scholar
žilinskas A (2013) On the worst-case optimal multi-objective global optimization. Optim Lett 7(8):1921–1928
Article Google Scholar

Download references

Funding

Jack Noonan was financially supported by the EPSRC grant number EP/N509449/1.

Author information

Authors and Affiliations

School of Mathematics, Cardiff University, Cardiff, CF244AG, UK
Anatoly Zhigljavsky & Jack Noonan

Authors

Anatoly Zhigljavsky
View author publications
You can also search for this author in PubMed Google Scholar
Jack Noonan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anatoly Zhigljavsky.

Ethics declarations

Conflict of Interest

The authors declare that they have no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix 1. Several Facts about d-Dimensional Balls and Cubes

In this appendix, we briefly mention several facts, used in the main part of the paper, related to high-dimensional cubes and balls. Many of these facts are somewhat counter-intuitive and often lead to creation of wrong heuristics in multivariate optimization and misunderstanding of the behaviour of even simple algorithms in high-dimensional spaces. For more details concerning the material of Sections A1.1–A1.4, see [1].

1.1 A1.1. Volume of the Ball

The volume of the ball ${{\mathscr{B}}}_{d}({ r })= \{ x \in \mathbb {R}^{d}: \| x \| \leq { r }\} $ can be computed by the formula

$$ {\text{vol}({\mathcal{B}}_{d}(r))= { r }^{d} V_{d}, where} V_{d}= \text{vol}({\mathcal{B}}_{d}(1))= \frac{ \pi^{d/2} }{ {\varGamma} (d/2+1)} . $$

(32)

The volumes V_d decrease very fast as d grows. For example, $V_{100}\simeq 2.368 \cdot 10^{-40}$. As $ d \to \infty $,

$$ V_{d}^{1/d} \simeq {\sqrt {2\pi e}} \frac{1}{\sqrt{d}} +O \left( \frac {\log d}{{d}^{3/2}} \right) . $$

(33)

1.2 A1.2. Radius of the Ball of Unit Volume

Define r_d by $\text {vol} ({{\mathscr{B}}}_{d}({ r }_{d}))=1$. Table 7 gives approximate values of r_d.

Table 7 Radius of the ball of unit volume for different dimensions

Full size table

From (33), for large d we have

$$ r_d=\frac {\sqrt{d}}{\sqrt {2\pi e}} +O \left( \frac{1}{\sqrt{d}} \right) $$

where $1/{\sqrt {2\pi e}} \simeq 0.242$. This is only about twice smaller than $\sqrt {d}/2$, the length of the half-diagonal of the d-dimensional unit cube [0, 1]^d.

For r_d,2δ defined by $\text {vol} ({{\mathscr{B}}}_{d}(r_{d,2 \delta }))= \text {vol} ({\mathcal {C}}_{d}(\delta ) ) = (2 \delta )^{d}$, we have r_d,2δ = 2δr_d.

1.3 A1.3. Almost All the Volume is Near the Boundary

First, consider the cube ${\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}$, with 0 < δ < 1, as interior to the cube ${\mathcal {C}}_{d}=[-1,1]^{d}$. For the ratio of the volumes of these two cubes, we have ${\text {vol}({\mathcal {C}}_{d}(\delta ))} /{\text {vol}({\mathcal {C}}_{d})}= \delta ^{d} $ which tends to 0 (as $d \to \infty $) exponentially fast for any δ ∈ (0, 1).

If, as $d \to \infty $, δ changes getting closer to 1 but 1 − δ tends to 0 slower than 1/d, then the ratio of the two volumes still tends to 0. In particular, if 1 − δ = c/d^1−δ with 0 < δ < 1 then

$$ \frac{\text{vol}({\mathcal{C}}_{d}(\delta))} {\text{vol}({\mathcal{C}}_{d})}= \delta^d \simeq \exp\{- c d^{1-\delta} \} \to 0 , d \to \infty . $$

Consider now the balls ${{\mathscr{B}}}_{d}(1)$ and ${{\mathscr{B}}}_{d}(1-\epsilon )$. The difference ${{\mathscr{B}}}_{d}(1)\setminus {{\mathscr{B}}}_{d}(1-\epsilon )$ is called the annulus. Using (32) we can compute the ratio of volume of this annulus to the volume of the unit ball:

$$ \frac{\text{vol}\left[{\mathcal{B}}_{d}(1)\setminus {\mathcal{B}}_{d}(1-\epsilon)\right]}{\text{vol}({\mathcal{B}}_{d}(1))}= 1-\varepsilon^d . $$

This ratio tends to 1 exponentially fast as $d \to \infty $. The ratio of volume of the ball ${{\mathscr{B}}}_{d}(1-\epsilon )$ to the volume of the unit ball ${{\mathscr{B}}}_{d}(1)$ is, similarly to the case of the cubes above, (1 − ε)^d. This result extends to any measurable set $A \subset \mathbb {R}^{d}$. Indeed, define the set A_1−ε = {(1 − ε)x : x ∈ A}. Then, by splitting A and A_1−ε into infinitesimal cubes and adding up their volumes, we find vol(A_1−ε) = (1 − ε)^dvol(A).

1.4 A1.4. The Area of Volume Concentration in a Cube

Let X = (x₁, … , x_d) be uniformly distributed on ${\mathcal {C}}_{d} =[-1,1]^{d}$. Then ${x_{1}^{2}}, \ldots , {x_{d}^{2}}$ are independent r.v. on [0, 1]. The Hoeffding’s inequality gives

$$ \mathbb{P} \left\{ \left| \frac{1}{d}\left( x_1^{2}+ \ldots+ x_d^{2}\right)- \frac{1}{d}\mathbb{E}\left( x^{2}_1+ \ldots+ x^{2}_d \right) \right| \geq \epsilon \right\} \leq 2 e^{-2d\epsilon^{2}} . $$

Since $\mathbb {E} {{x}^{2}_{i}}= \frac {1}{3}$, we obtain

$$ \mathbb{P} \left\{ \left| \|X\|^{2} -\frac{d}{3}\right| \geq \epsilon d \right\} \leq 2 e^{-2d\epsilon^{2} } . $$

Therefore, the main volume in the cube ${\mathcal {C}}_{d}$ is concentrated in the annulus around the sphere with radius $\sqrt {d/3}$.

1.5 A1.5. Squared Norm of a Random Point in a Cube

Let Z = (z₁, … , z_d) be a random vectors on ${\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}$ consisting of i.i.d. random components z_i having a distribution with density p(t), t ∈ [−δ,δ], δ > 0.

Set $\eta ={\sum }_{j=1}^{d} {z_{j}^{2}}$. We have $\mathbb {E} \eta = d \mu _{2} $ and $\text {var}(\eta )=d \text {var}({{z}_{1}^{2}})=d(\mu _{4}-{{\mu }_{2}^{2}})$, where μ_j be the moments of the distribution with density p(t).

For example, when z_i have Beta(α,α) distribution with density

$$ p_{\alpha,\delta}(t)= \frac{(2\delta)^{1-2\alpha}}{{\text{Beta}(\alpha,\alpha)}} [\delta^{2}-t^{2}]^{\alpha-1} , -\delta<t<\delta ,\alpha>0, $$

(34)

where Beta(⋅,⋅) is the beta-function, then

$$ \mu_{2}=\frac{\delta^{2}}{2\alpha+1} , \mu_4=\frac{3 \delta^{4}}{(2\alpha+1)(2\alpha+3)} $$

(35)

and therefore

$$ \mathbb{E} \eta= \frac{d\delta^{2}}{2\alpha+1} , \text{var}(\eta)= \frac{4d \delta^{4} \alpha}{(2\alpha+1)^{2}(2\alpha+3)} . $$

(36)

If α = 1, when Z is uniform in the cube ${\mathcal {C}}_{d}(\delta )$, then

$$ \mathbb{E} \eta= \frac{1}{3} d\delta^{2} , \text{var}(\theta)= \frac{4}{45} d \delta^{4} . $$

(37)

1.6 A1.6. Distance Between Two Random Points in a Cube

Assume Z = (z₁, … , z_d) and $Z^{\prime }=(z^{\prime }_{1}, \ldots , z^{\prime }_{d})$ are independent random vectors on ${\mathcal {C}}_{d}(\delta )=[-\delta ,\delta ]^{d}$ consisting of i.i.d. random components z_i and $z^{\prime }_{i}$ which have some distribution with density p(t), t ∈ [−δ,δ], δ > 0. Let μ_j be the moments of the distribution with density p(t). Assume that the density p(t) is symmetric around 0 and hence all odd moments are zero: μ_2k+ 1 = 0 for k = 1, 2,…

The distribution of the squared distances

$$ \theta=\|Z-Z^{\prime}\|^{2} = \sum\limits_{i=1}^{d} (z_{i}-z^{\prime}_{i})^{2} $$

has the mean and variance that can be easily computed as follows:

$$ \begin{array}{@{}rcl@{}} \mathbb{E} \theta= d \mathbb{E}(z_1-{z}_{1}^{\prime})^{2} &=& 2d \mu_{2} ,\\ \text{var}(\theta)= d \text{var}(z_1-{z}_{1}^{\prime})^{2}& =& d \left[[\mathbb{E}(z_1-{z}_{1}^{\prime})^4-[\mathbb{E}(z_1-{z}_{1}^{\prime})^{2})]^{2} \right]= 2d \left[\mu_4+ \mu_{2}^{2} \right] \end{array} $$

For example, when z_i and $z^{\prime }_{i}$ have Beta(α,α) distribution with density (34) and hence moments (35), we obtain

$$ \mathbb{E} \theta= \frac{2d\delta^{2}}{2\alpha+1} , \text{var}(\theta)= \frac{4d \delta^{4} (4\alpha+3)}{(2\alpha+1)^{2}(2\alpha+3)} . $$

(38)

If α = 1 (that is, when Z and $Z^{\prime }$ are uniform in the cube ${\mathcal {C}}_{d}(\delta )$), then

$$ \mathbb{E} \theta= \frac{2}{3} d\delta^{2} , \text{var}(\theta)= \frac{28}{45} d \delta^{4} $$

(39)

1.7 A1.7. Volume of the Intersection of Two Balls of the Same Radius

Let ${{\mathscr{B}}}_{d}(Z_{j},r) $ and $ {{\mathscr{B}}}_{d}(Z_{i},r)$ be two balls in $\mathbb {R}^{d}$ with same radius and different centres Z and $Z^{\prime }$. To compute the volume of the intersection ${{\mathscr{B}}}_{d}(Z,r) \cap {{\mathscr{B}}}_{d}(Z^{\prime },r)$, we will use the formula for the volume of the d-dimensional cap (cut in the direction of $Z^{\prime }$) of height h from a d-dimensional ball ${{\mathscr{B}}}_{d}(Z,r) $:

$$ K_{d,r,h}=\frac{1}{2} r^{d} V_{d} I_{1-h^{2}/r^{2}}\left( \frac{d-1}{2},\frac{1}{2} \right) - \frac{h}{d}(r^{2}-h^{2})^{(d-1)/2} V_{d-1} , $$

(40)

where V_d is defined in (32), Γ(⋅) is the gamma-function and

$$ I_{t}(\alpha,\beta)= {{\int}_{0}^{t}} u^{\alpha-1}(1-u)^{\beta-1} du \left/ {{\int}_{0}^{1}} u^{\alpha-1}(1-u)^{\beta-1} du\right. $$

is the normalized incomplete beta-function. In the rhs of (40), the first term is the volume of the related d-dimensional hyper-sector (this expression is derived in [7]) and the second term is the volume of the cone with height h and base ${{\mathscr{B}}}_{d-1}((Z+Z^{\prime })/2,r^{\prime }) $, where $r^{\prime }= \sqrt {r^{2}-h^{2}}$.

The volume of the intersection of the balls ${{\mathscr{B}}}_{d}(Z,r)$ and $ {{\mathscr{B}}}_{d}(Z^{\prime },r)$ is therefore

$$ \text{vol}({\mathcal{B}}_{d}(Z,r) \cap {\mathcal{B}}_{d}(Z^{\prime},r))=2K_{d,r,h} $$

(41)

where $h=\frac {1}{2} \|Z-Z^{\prime }||$ and K_d,r,h is defined in (40).

1.8 A1.8. A Direct Computation of C_d,Z,r

For computing values of C_d,Z,r, we can employ the following direct approach based on the use of characteristic functions (c.f.).

(a) Compute the c.f. $ \psi _{z}(s)= {\int \limits } e^{its} \varphi _{z}(t)dt$ for z = z_j (j = 1, … , d), with the density φ_z(t) defined either by (7) or (9).
(b) As u_j are independent, the c.f. of ∥U − Z∥² is the product $\psi _{Z}(s)={\prod }_{j=1}^{d}\psi _{z_{j}}(s)$.
(c) The density of ∥U − Z∥² is found using the inversion formula
$$ p_{d,Z} (x)= \frac1{2\pi} {\int}_{-\infty}^{\infty} e^{-isx} \psi_{Z}(s) ds , x \geq 0 . $$

For computing the c.f. $ \psi _{z}(s)= {\int \limits } e^{its} \varphi _{z}(t)dt$ we can use the formula

$$ {{\int}_{a}^{b}} \frac{e^{xt}}{\sqrt{t}} dt = 2{\int}_{\sqrt{a}}^{\sqrt{b}} e^{x u^{2}} du= \sqrt{\frac{\pi}{x}} \left( \text{erfi}(\sqrt{bx})-\text{erfi}(\sqrt{ax})\right) $$

for any $0 \leq a<b<\infty $ and any complex x≠ 0. Here erfi(x) is the imaginary error function

$$ \text{erfi}(x)=\frac{2}{\sqrt{\pi}} {{\int}_{0}^{x}}e^{t^{2}} dt= \frac{2}{\sqrt{\pi}} \sum\limits_{j=0}^{\infty} \frac{x^{2j+1}}{j!(2j+1)} ; $$

the series in the right-hand side of this formula converges for all complex x.

This approach allows very accurate computation of C_d,Z,r but it is very computationally intensive and can only be performed for given Z.

Appendix 2. Important Auxiliary Results

Lemma 1

Let δ > 0, $x \in \mathbb {R} $ and η_x,δ be a r.v. η_x,δ = (ξ − x)², where r.v. ξ has uniform distribution on [−δ,δ]. Then the c.d.f. of the r.v. η_x,δ is

$$ \begin{array}{@{}rcl@{}} F_{x,\delta}(t) = \mathbb{P}\{\eta_{x,\delta} \leq t \}= \left\{ \begin{array}{ll} 0 & \text{for} t\leq 0\\ \frac{\sqrt{t}}{\delta} \cdot 1_{[ |x|\leq \delta]}& \text{for} 0<t< (\delta- |x|)^{2}\\ \frac{\delta -|x|+\sqrt{t}}{2\delta} & \text{for} (\delta- |x|)^{2} \leq t\leq (\delta+ |x|)^{2} \\ 1 & (\delta+ |x|)^{2} < t , \end{array} \right. \end{array} $$

(42)

where

$$ 1_{[ |x|\leq \delta]}= \left\{ \begin{array}{ll} 1 & \text{if} |x | \leq \delta\\ 0 & \text{if} |x | > \delta . \end{array} \right. $$

The corresponding density of η_x,δ is

$$ \varphi_{x,\delta}(t) = \left\{ \begin{array}{ll} 1/(2\delta\sqrt{t}) \cdot 1_{[ |x|\leq \delta]} & \text{for} 0<t< (\delta- |x|)^{2}\\ 1/(4\delta\sqrt{t}) & \text{for} (\delta- |x|)^{2} < t\leq (\delta+ |x|)^{2} \\ 0 & \text{otherwise.} \end{array} \right. $$

(43)

The first four central moments of the r.v. η_x,δ are:

$$ {\mu}_{x,\delta}^{(1)} = E\eta_{x,\delta} =x^{2} +\frac{\delta^{2}}{3}, {\mu}_{x,\delta}^{(2)} = \text{var}(\eta_{x,\delta}) = \frac{4\delta^{2}}{3} \left( x^{2} +\frac{\delta^{2}}{15} \right) , $$

(44)

$$ {\mu}_{x,\delta}^{(3)}=E \left[\eta_{x,\delta} - E\eta_{x,\delta}\right]^{3} = \frac{16\delta^{4}}{15} \left( x^{2} +\frac{\delta^{2}}{63} \right) , $$

(45)

$$ {\mu}_{x,\delta}^{(4)}=E \left[\eta_{x,\delta} - E\eta_{x,\delta}\right]^4 = 3 {\mu}_{x,\delta}^{(1)} {\mu}_{x,\delta}^{(3)} . $$

(46)

Proof

Clearly, if t ≤ 0 then F_x,δ(t) = 0 and so we only consider the case t > 0. In view of symmetry, for all $x\in \mathbb {R}$, δ > 0 and t ≥ 0, we have F_x,δ(t) = F_−x,δ(t) and therefore we only need to consider x ≥ 0. Also, η_x,δ ≤ (|x| + δ)² with probability 1 implying F_x,δ(t) = 1 for all t ≥ (|x| + δ)².

Assume 0 ≤ x ≤ δ. We then have for all t ≥ 0:

$$ \begin{array}{@{}rcl@{}} F_{x,\delta}(t)&=& \mathbb{P}\{(\xi-x)^{2} \leq t \}= \mathbb{P}\{(\xi-x)^{2} \leq t, \xi \leq x \}+\mathbb{P}\{(\xi-x)^{2} \leq t, \xi > x \}\\ &=& \mathbb{P}\{x-\xi \leq \sqrt{ t}, \xi \leq x \}+\mathbb{P}\{\xi-x \leq \sqrt{ t}, \xi > x \}\\ &=& \mathbb{P}\{x- \sqrt{ t} \leq \xi \leq x \}+\mathbb{P}\{x<\xi \leq x+ \sqrt{ t} \} \end{array} $$

with

$$ \mathbb{P}\{x- \sqrt{ t} \leq \xi \leq x \}= \left\{ \begin{array}{ll} \sqrt{ t}/(2 \delta) & \text{if} \sqrt{ t}<x+\delta \\ (x+\delta)/(2 \delta) & \text{if} \sqrt{ t} \geq x+\delta , \end{array} \right. $$

$$ \mathbb{P}\{x<\xi \leq x+ \sqrt{ t} \}= \left\{ \begin{array}{ll} \sqrt{ t}/(2 \delta) & \text{if} \sqrt{ t}<\delta -x \\ (\delta -x)/(2 \delta) & \text{if} \sqrt{ t} \geq \delta -x . \end{array} \right. $$

This yields the expression (42) for F_x,δ(t) in the case |x|≤ δ.

If x > δ then η_x,δ ≥ (x − δ)² with probability 1 implying F_x,δ(t) = 0 for all t ≤ (x − δ)² and $\mathbb {P}\{x<\xi \leq x+ \sqrt {t}\}=0$ for all t. Therefore

$$ F_{x,\delta}(t)= \mathbb{P}\{x- \sqrt{ t} \leq \xi \leq x \}= \left\{ \begin{array}{ll} 0 & \text{if} \sqrt{ t} \leq x-\delta \\ \frac{\delta-(x- \sqrt{ t})}{2 \delta} & \text{if} x-\delta<\sqrt{ t}<x+\delta \\ 1 & \text{if} \sqrt{ t} \geq x+\delta , \end{array} \right. $$

This yields the expression (42) for F_x,δ(t) in the case |x| > δ.

Deduction of the formulas (43) for the density and (44) for the moments from the expression (42) for the c.d.f. F_x,δ(t) is an easy exercise. □

Lemma 2

Let δ > 0, $x \in \mathbb {R} $ and $\eta _{x,\delta }^{\prime }$ be a r.v. $\eta _{x,\delta }^{\prime } = |\xi -x|$, where r.v. ξ has uniform distribution on [−δ,δ]. Then the c.d.f. of the r.v. $\eta _{x,\delta }^{\prime } $ is

$$ {F}_{x,\delta}^{\prime}(t) = \mathbb{P}\{{\eta}_{x,\delta}^{\prime} \leq t \}= \left\{\begin{array}{ll} 0 & \text{for } t\leq 0\\ \frac{{t}}{\delta} \cdot 1_{[ |x|\leq \delta]}& \text{for} 0<t< |\delta- |x| |\\ \frac{\delta -|x|+{t}}{2\delta} & \text{for} |\delta- |x| | \leq t\leq \delta+ |x| \\ 1 & \delta+ |x| < t , \end{array} \right. $$

(47)

The corresponding density of $\eta _{x,\delta }^{\prime } $ is

$$ {\varphi}_{x,\delta}^{\prime}(t) = \left\{ \begin{array}{ll} \frac1{\delta} \cdot 1_{[ |x|\leq \delta]}& \text{for} 0<t< |\delta- |x| |\\ \frac1{2\delta} & \text{for } |\delta- |x| | < t\leq \delta+ |x| \\ 0 & \text{otherwise.} \end{array} \right. $$

(48)

Lemma 2 follows from Lemma 1 by noting that ${\eta }_{x,\delta }^{\prime } = \sqrt {\eta _{x,\delta }}$.

Note that 1_[|x|≤δ] = 0 for |x| > δ and one of the two non-trivial cases in (42), (43), (47) and (48), when |x| > δ, become trivial as expressions vanish to zero.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhigljavsky, A., Noonan, J. Covering of High-Dimensional Cubes and Quantization. SN Oper. Res. Forum 1, 18 (2020). https://doi.org/10.1007/s43069-020-0015-8

Download citation

Received: 24 February 2020
Accepted: 14 May 2020
Published: 13 August 2020
DOI: https://doi.org/10.1007/s43069-020-0015-8

Keywords

Mathematics Subject Classification (2010)

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Covering of High-Dimensional Cubes and Quantization

Abstract

Similar content being viewed by others

Efficient Quantisation and Weak Covering of High Dimensional Cubes

Non-lattice Covering and Quantization of High Dimensional Sets

Capacitated Covering Problems in Geometric Spaces

1 Introduction

1.1 Main Notation

1.2 Main Problem of Interest

1.3 Two Contradictory Criteria and a Compromise

1.4 Covering a Cube by Smaller Cubes and Quantization

1.5 Structure of the Paper and Main Results

2 Volume of Intersection of a Cube and a Ball

2.1 The Main Quantity of Interest

2.2 A Generalization of the Quantity (3)

2.3 Normal Approximation for the Quantity (3)

2.4 Improved Normal Approximation

2.5 Simulation Study

3 Covering a Cube by n Balls

3.1 The Main Covering Scheme

3.2 Theoretical Investigation of Scheme 1

3.3 Simulation Study for Assessing Accuracy of Approximations (25) and (26)

3.4 Other Schemes

3.5 Numerical Comparison of Schemes

4 Covering a Cube by Cubes

4.1 Volume of Intersection of Two Cubes

4.2 Proportion of a Cube Covered by Smaller Cubes with Random Centres

5 Quantization

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interest

Additional information

Publisher’s Note

Appendices

Appendix 1. Several Facts about d-Dimensional Balls and Cubes

1.1 A1.1. Volume of the Ball

1.2 A1.2. Radius of the Ball of Unit Volume

1.3 A1.3. Almost All the Volume is Near the Boundary

1.4 A1.4. The Area of Volume Concentration in a Cube

1.5 A1.5. Squared Norm of a Random Point in a Cube

1.6 A1.6. Distance Between Two Random Points in a Cube

1.7 A1.7. Volume of the Intersection of Two Balls of the Same Radius

1.8 A1.8. A Direct Computation of Cd,Z,r

Appendix 2. Important Auxiliary Results

Lemma 1

Proof

Lemma 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2010)

Search

Navigation

1.8 A1.8. A Direct Computation of C_d,Z,r