1 Introduction

Vandermonde matrices with complex nodes appear in polynomial interpolation problems and many other fields of mathematics (see, e.g., the introduction of [2] and its references). In this paper, we are interested in rectangular Vandermonde matrices with nodes on the complex unit circle and with a large polynomial degree. These matrices generalize the classical discrete Fourier matrices to non-equispaced nodes and the involved polynomial degree is also called bandwidth. The condition number of those matrices has recently become important in the context of stability analysis of super-resolution algorithms like Prony’s method [6, 15], the matrix pencil method [12, 18], the ESPRIT algorithm [20, 21], and the MUSIC algorithm [17, 22]. If the nodes of such a Vandermonde matrix are all well-separated, with minimal separation distance greater than the inverse bandwidth, bounds on the condition number are established for example in [2, 5, 14, 18].

If nodes are nearly colliding, i.e., their distance is smaller than the inverse bandwidth, the behavior of the condition number is not yet fully understood. The seminal paper [9] coined the term (inverse) super-resolution factor for the product of the bandwidth and the separation distance of the nodes. For M nodes on a grid, the results in [7, 9] imply that the condition number grows like the super-resolution factor raised to the power of M − 1 if all nodes nearly collide. More recently, the practically relevant situation of groups of nearly colliding nodes was studied in [1, 4, 16, 19]. In different setups and oversimplifying a bit, all of these refinements are able to replace the exponent M − 1 by the smaller number m − 1, where m denotes the number of nodes that are in the largest group of nearly colliding nodes. The authors of [1, 19] focus on quite specific quantities in an optimization approach and in the so-called Prony mapping, respectively. In contrast, the condition number or the relevant smallest singular value of Vandermonde matrices with “off the grid” nodes on the unit circle is studied in [4, 16]. While [4] provided the exponent m − 1 for the first time, the proof technique leads to quite pessimistic constants and more restrictively asks all nodes (including the well-separated ones) to be within a tiny arc of the unit circle. More recently, the second version of [16] provided a quite general framework and reasonable sharp constants, but involves a technical condition which prevents the separation distance from going to zero for a fixed number of nodes and a fixed bandwidth.

Here, we present upper and lower bounds for the condition number of Vandermonde matrices with pairs of nearly colliding nodes, i.e., the special case m = 2. We achieve the expected linear order and all constants are reasonably sharp and absolute. In contrast to the more general quoted results [4, 16], the nodes can be placed on the full unit circle and the separation distance is allowed to approach zero. Our mild technical conditions, which seem to be artifacts of our proof technique, are:

  1. (i)

    A logarithmic growth in the separation distance of the well-separated nodes (which can be dropped at a price of a larger constant for the condition number estimate),

  2. (ii)

    A uniformity condition that colliding nodes behave similarly (they have the same separation distance up to a predefined constant), and

  3. (iii)

    An a priori upper bound on the separation distance of the colliding nodes.

The outline of this paper is as follows: Section 2 fixes the notation, recalls results for the case of well-separated nodes, and provides lower bounds for the condition number. In Section 3, we establish upper bounds for nodes that are well-separated from each other except for one pair of nodes that is nearly colliding. Section 4 goes one step further and studies the more general case where an arbitrary number of pairs of nodes nearly collide. Theoretical and numerical comparisons with [3, 4, 8, 16] can be found at the end of Section 4 and in Section 5.

2 Preliminaries

Let \({\mathbb {T}}:=\left \{ z\in {\mathbb {C}}\colon \left | z \right |=1 \right \}\) be the complex torus and nodes \(\left \{ {z_{1},\dots ,z_{M}} \right \} \subset {\mathbb {T}}\) be parametrized by \(z_{j}= \textnormal {e}^{-2\pi \textnormal {i}{{t_{j}}}}, j=1\dots ,M\), such that \(t_{1}< \cdots < t_{M} \in \left [ 0,1 \right )\). We fix a degree \(n \in {\mathbb {N}}\) so that N := 2n + 1 > M and set up the rectangular Vandermonde matrix:

$$ \begin{array}{@{}rcl@{}} A:= \begin{pmatrix} {z_{j}^{k}} \end{pmatrix}_{ \begin{array}{c}j{=}1,{\dots},M \\ \left| k \right|{\le} n \end{array}} = \begin{pmatrix} z_{1}^{-n}& {\cdots} & z_{1}^{-1} & 1 & {z_{1}^{1}} & {\cdots} & {z_{1}^{n}}\\ {\vdots} & & {\vdots} & & {\vdots} \\ z_{M}^{-n}& {\cdots} & z_{M}^{-1} & 1 & {z_{M}^{1}} & {\cdots} & {z_{M}^{n}} \end{pmatrix} \in {\mathbb{C}}^{M\times N}. \end{array} $$
(2.1)

The Dirichlet kernel \(D_{n} \colon {\mathbb {R}}\rightarrow {\mathbb {R}}\) is given by:

$$ D_{n}(t):= \sum\limits_{k=-n}^{n}\textnormal{e}^{2\pi\textnormal{i}{kt}} = \begin{cases} N, & t\in {\mathbb{Z}},\\ \frac{\sin(N\pi t)}{\sin(\pi t)}, & \text{otherwise}, \end{cases} $$
(2.2)

so that

$$ K := AA^{*}= \begin{pmatrix} {D_n\left( t_{i}-t_{j}\right)} \end{pmatrix}_{i,j=1}^{M} \in {\mathbb{R}}^{M\times M}. $$

The matrix K is symmetric positive definite and the spectral condition number

$$ \text{cond}(A) := \frac{\sigma_{\max}(A)}{\sigma_{\min}(A)}=\sqrt{\left\Vert K\right\Vert\left\Vert K^{-1}\right\Vert} $$

is finite since all nodes are distinct (here and throughout the paper \(\|K\|:=\sup \{\|Kx\|: \|x\|=1\}\) with \(\|x\|^{2}:={\sum }_{k}|x_{k}|^{2}\)). On the other hand, if two nodes are equal, then two rows of A are the same and by continuity the condition number diverges if two nodes collide. The (wrap around) distance of two nodes is given by:

$$ \left| t_{j}-t_{\ell} \right|_{{\mathbb{T}}}:= \underset{r\in {\mathbb{Z}}}{\min}\left| t_{j}-t_{\ell}+r \right|. $$

and we introduce the normalized separation distance of the node set as:

$$ \tau := N \underset{j\ne\ell}{\min}\left| t_{j}-t_{\ell} \right|_{{\mathbb{T}}}. $$

We call the case τ = 1 critical separation, i.e., \(\min \limits _{j\ne \ell }\left | {t_{j}-t_{\ell }} \right | _{{\mathbb {T}}} = \frac {1}{N}\), and the cases τ ≤ 1 and τ > 1 nearly colliding and well-separated, respectively. Figure 1 illustrates the situation for 4 nodes on the unit circle. The parameter ρmin describes a minimum separation distance of involved non-colliding nodes assumed in the theorems.

Fig. 1
figure 1

Sketch of four-node configurations, \(t_{1}< t_{2}< t_{3}< t_{4}\in \left [ 0,1 \right )\), t1 = 0, t3 = 1/2, N large enough, \(\tau _{1,2}:=N\left | {t_{1}-t_{2}} \right | _{{\mathbb {T}}}\), \(\tau _{3,4}:=N\left | t_{3}-t_{4} \right |_{{\mathbb {T}}}\). dotted, Theorem can be applied; filled, well-separated; lined, 3 nearly colliding nodes; empty areas, at most 2 nearly coll. nodes, but not covered by results

A reasonable result for well-separated nodes is as follows.

Theorem 2.1

[2, 18] Let A be a Vandermonde matrix as in (2.1) with τ > 1, then

$$ N\left( 1-\frac{1}{\tau}\right) \le \sigma_{\min}^{2}(A)\le N \le \sigma_{\max}^{2}(A)\le N\left( 1+\frac{1}{\tau}\right). $$

In particular, we have

$$ \textnormal{cond}(A)^{2} \le 1+ \frac{2}{\tau-1} $$

and this implies \(\left \Vert K\right \Vert \le N + N/\tau \) and \(\left \Vert K^{-1}\right \Vert =\left \Vert A^{\dagger }\right \Vert ^{2}\le (N - N/\tau )^{-1}\), where A := A(AA)− 1 denotes the Moore-Penrose pseudo inverse of A.

We note in passing that the above lower bound on the smallest singular value is an improvement of [18] by [2] and that [18] and [8] allow to replace \(\frac {1}{\tau }\) in the upper and the lower bounds by \(\frac {1}{\tau }-\frac {1}{N}\), respectively. Moreover, we have the following lower bound on the condition number. This already shows that the upper bound for well-separated nodes is quite sharp and provides the benchmark for nearly colliding nodes.

Theorem 2.2 (Lower bound)

Let A be a Vandermonde matrix as in (2.1), then:

$$ \sigma_{\min}^{2}(A) \le N-\left| D_{n}(\tau/N) \right| \le N \le N+\left| D_{n}(\tau/N) \right|\le\sigma_{\max}^{2}(A). $$

In particular, we have:

$$ \textnormal{cond}(A)^{2} \ge 1+\frac{2}{\pi\tau-1} $$

for \(\tau \in {\mathbb {N}}+\frac {1}{2}\), uniformly in N and almost matching the above upper bound.

For nearly colliding nodes, we have:

$$ \textnormal{cond}(A)^{2} \ge \frac{12}{\pi^{2}\tau^{2}}-1 \ge \frac{1}{\tau^{2}} $$

for \(\tau \le \sqrt {12/\pi ^{2}-1}\approx 0.46\) and \(\textnormal {cond}(A) \ge \sqrt {6}/\pi \tau \approx 0.77/\tau \) for all τ ≤ 1.

Proof

Without loss of generality, let t2t1 = τ/N and consider the upper left 2 × 2 block in:

$$ K= \begin{pmatrix} C & * \\ * & * \end{pmatrix}, \quad C:=\begin{pmatrix} {D_n\left( 0\right)} & {D_n\left( \tau/N\right)} \\ {D_n\left( \tau/N\right)} & {D_n\left( 0\right)} \end{pmatrix}. $$

We apply Lemma A.5, and get:

$$ \text{cond}(A)^{2}=\frac{\lambda_{\max}(K)}{\lambda_{\min}(K)}\ge \frac{\lambda_{\max}(C)}{\lambda_{\min}(C)} = \frac{D_n(0)+\left| D_n\left( \tau/N\right)\right|}{D_n(0)-\left| D_n\left( \tau/N\right)\right|} = 1+\frac{2\left| {D_n\left( \tau/N\right)} \right|}{N-\left| {D_n\left( \tau/N\right)} \right|}, $$

and Lemma A.1 yields the assertion. □

3 Nodes with one nearly colliding pair

Definition 3.1

Let M ≥ 2 and \(0=t_{1}<\cdots <t_{M} \in \left [ 0,1 \right )\) such that:

$$ \begin{array}{@{}rcl@{}} \left| t_{1}-t_{2} \right|_{{\mathbb{T}}} &=& \frac{\tau}{N},\qquad \qquad\qquad\qquad0<\tau\le 1,\\ \left| t_{j}-t_{\ell} \right|_{{\mathbb{T}}} &\ge& \frac{\rho}{N}, j\ne \ell, \ell\ge 3, \qquad~1<\rho<\infty, \end{array} $$

then {t1,…,tM} is called a set of nodes with one nearly colliding pair; see Fig. 2 for an illustration. Due to periodicity, the choice t1 = 0 and \(\left | t_{1}-t_{2} \right |_{{\mathbb {T}}} = \frac {\tau }{N}\) is without loss of generality.

Fig. 2
figure 2

Example of a node set with M = 5 satisfying Def. 3.1

Now, we estimate an upper bound on the condition number of the Hermitian matrix K by bounding \(\left \Vert {K}\right \Vert \) directly and applying Lemma A.4 to K− 1 before bounding \(\left \Vert K^{-1}\right \Vert \). For that, we introduce some notation for abbreviation.

Definition 3.2

We define \(a_{1} := \begin {pmatrix} {z_{1}^{k}} \end {pmatrix}_{\left | k \right |\le n}\in {\mathbb {C}}^{1\times N}\) and \(A_{2}:= \begin {pmatrix} {z_{j}^{k}} \end {pmatrix}_{ \begin {array}{c}j{=}2,{\dots },M\\ \left | k \right |{\le } n \end {array}}\in {\mathbb {C}}^{(M-1)\times N}\) so that with:

$$ \begin{array}{@{}rcl@{}} a_{1}a_{1}^{*}=N, \quad K_{2}:= A_{2}A_{2}^{*} \quad\text{and}\quad b:=A_{2} a_{1}^{*}=\begin{pmatrix} D_{n}(\tau/N)\\D_{n}(t_{3})\\ \vdots\\ D_{n}(t_{M}) \end{pmatrix}, \end{array} $$
(3.1)

we have the partitioning:

$$ \begin{array}{@{}rcl@{}} A=\begin{pmatrix} a_{1}\\ A_{2} \end{pmatrix} \quad \text{and} \quad K =\begin{pmatrix} N & b^{*} \\ b & K_{2} \end{pmatrix}, \end{array} $$
(3.2)

where A2 is a Vandermonde matrix with nodes that are at least \(\frac {\rho }{N}\) separated.

Lemma 3.3

Under the conditions of Definition 3.1 and for ρ ≥ 6, we have:

$$ \left\Vert K\right\Vert\le 2.3 N. $$

Proof

The key idea is to see the set of nodes as a union of two well-separated subsets and use the existing bounds for these. In contrast to the next chapter, here, one of the sets only consists of a single node. We start by noting that Theorem 2.1 and (3.1) yield \(\left \Vert {b}\right \Vert ^{2}\le \left \Vert {a_{1}}\right \Vert ^{2} \left \Vert {A_{2}}\right \Vert ^{2}=N\left \Vert {K_{2}}\right \Vert \). Together with the decomposition (3.2), the triangle inequality, Lemma A.6, and Theorem 2.1, we obtain:

$$ \left\Vert K\right\Vert \le \left\Vert \begin{pmatrix} N&0\\0&K_{2} \end{pmatrix}\right\Vert +\left\Vert {\begin{pmatrix} 0&b^{*}\\b&0 \end{pmatrix}}\right\Vert \le\left\Vert {K_{2}}\right\Vert +\left\Vert {b}\right\Vert \le N\left( \frac{\rho+1}{\rho}+\sqrt{\frac{\rho+1}{\rho}}\right). $$

Lemma 3.4

Under the conditions of Definition 3.1 and with b as in (3.1), we have:

$$ b=K_{2} e_{1} +r, $$

where \(e_{1}\in {\mathbb {R}}^{(M-1)}\) denotes the first unit vector and:

$$ \left\Vert r\right\Vert^{2} \le \left( N- {D_n\left( \tau/N\right)} \right)^{2} +N^{2}\tau^{2} \left( \frac{\pi^{4}}{12\rho^{2}}+\frac{1.21\pi}{\rho^{3}}+ \frac{\pi^{4}}{180\rho^{4}} \right). $$

Proof

The vector b can be approximated by the first column of K2 in the sense that:

$$ b=\begin{pmatrix} {D_n\left( \tau/N\right)} \\ {D_n\left( t_{3}\right)} \\ \vdots\\ {D_n\left( t_{M}\right)} \end{pmatrix}=\begin{pmatrix} {D_n\left( 0\right)} \\ {D_n\left( t_{3}-\tau/N\right)} \\ \vdots\\ {D_n\left( t_{M}-\tau/N\right)} \end{pmatrix}+ \begin{pmatrix} r_{1} \\ \vdots\\ r_{M-1} \end{pmatrix}. $$

We have \(\left | r_{1} \right | = N- {D_n\left (\tau /N\right )} \) and for j = 2,…,M − 1 the mean value theorem yields:

$$ \left| r_{j} \right| = \left| {D_n\left( t_{j+1}\right)} - {D_n\left( t_{j+1} - \tau/N\right)} \right| = \left| {D_n^{\prime}\left( \xi_{j}\right)} \right|\frac{\tau}{N},\! \!\quad \xi_{j}\!\in\! \left( \left| {t_{j+1} - \frac{\tau}{N}} \right| _{{\mathbb{T}}},\left| {t_{j+1}} \right| _{{\mathbb{T}}}\right). $$

Note that, in the worst case, half of the nodes can be as close as possible (under the assumed separation condition) to t2 not only on its right but also on its left. Hence, for \(j=2,\dots ,\left \lceil \frac {M}{2}\right \rceil \), \(\xi _{j} \ge \frac {(j-1)\rho }{N}\) and Lemma A.1 lead to:

$$ \left| r_{j} \right|\le N\left( \frac{\pi}{2N\left| {\xi_{j}} \right| }+\frac{1}{2N^{2}\left| {\xi_{j}} \right| ^{2}}\right)\tau \le N\left( \frac{\pi}{2(j-1)\rho}+\frac{1}{2(j-1)^{2}\rho^{2}}\right)\tau. $$

Thus, for all nodes, we get:

$$ \sum\limits_{j=2}^{M-1}|r_{j}|^{2} \le 2 \sum\limits_{j=2}^{\lceil M/2\rceil}|r_{j}|^{2} \le N^{2}\tau^{2} \left( \frac{\pi^{2}}{2\rho^{2}} \underbrace{\sum\limits_{j=1}^{\infty}\frac{1}{j^{2}}}_{=\frac{\pi^{2}}{6}} + \frac{\pi}{\rho^{3}} \underbrace{\sum\limits_{j=1}^{\infty} \frac{1}{j^{3}}}_{\le 1.21} + \frac{1}{2\rho^{4}}\underbrace{\sum\limits_{j=1}^{\infty} \frac{1}{j^{4}}}_{=\frac{\pi^{4}}{90}} \right). $$

Lemma 3.5

Under the conditions of Definition 3.1 and for ρ ≥ 5, we have:

$$ \left\Vert K^{-1}\right\Vert\le \frac{C(\rho)}{N\tau^{2}}, $$

where

$$ C(\rho) = \left( \frac{2\rho-1}{\rho-1}+\sqrt{\frac{\rho}{\rho-1}} \right) \left[2- \frac{\rho}{\rho-1} \left( 1+\frac{\pi^{4}}{12\rho^{2}}+\frac{1.21\pi}{\rho^{3}}+\frac{\pi^{4}}{180\rho^{4}} \right)\right]^{-1}. $$

Proof

We consider K decomposed as in (3.2) and apply Lemma A.4 with respect to K2 to obtain:

$$ K^{-1} = \begin{pmatrix} I&0\\-K_{2}^{-1}b&I \end{pmatrix} \begin{pmatrix} (N-b^{*}K_{2}^{-1}b)^{-1}& 0\\0&K_{2}^{-1} \end{pmatrix} \begin{pmatrix} I & -b^{*}K_{2}^{-1} \\ 0 & I \end{pmatrix} $$

and thus,

$$ \left\Vert K^{-1}\right\Vert \le \left\Vert \begin{pmatrix} I&0\\-K_{2}^{-1}b&I \end{pmatrix}\right\Vert^{2} \max\left\{\left\Vert {K_{2}^{-1}}\right\Vert,\left\Vert {\left( N-b^{*}K_{2}^{-1}b\right)^{-1}}\right\Vert \right\}. $$

First of all, we establish an upper bound for the norm of the triangular matrix. Equation (3.1) and Theorem 2.1 imply:

$$ \left\Vert K_{2}^{-1}b\right\Vert=\left\Vert (A_{2} A_{2}^{*})^{-1} A_{2} a_{1}^{*}\right\Vert\le \left\Vert A_{2}^{\dagger}\right\Vert\left\Vert a_{1}\right\Vert\le\sqrt{\frac{\rho}{\rho-1}}. $$

Together with Lemma A.6, we obtain:

$$ \begin{array}{@{}rcl@{}} \left\Vert \begin{pmatrix} I&0\\-K_{2}^{-1}b&I \end{pmatrix}\right\Vert^{2} \le 1+\left\Vert K_{2}^{-1}b\right\Vert+\left\Vert K_{2}^{-1}b\right\Vert^{2} \le \frac{2\rho-1}{\rho-1}+\sqrt{\frac{\rho}{\rho-1}}. \end{array} $$
(3.3)

The next step is to bound \((N-b^{*}K_{2}^{-1}b)^{-1}\). Lemma 3.4 yields:

$$ b^{*}K_{2}^{-1}b = (K_{2} e_{1}+ r)^{*} K_{2}^{-1} (K_{2} e_{1}+ r) = 2 D_{n}(\tau/N)-D_{n}(0)+r^{*}K_{2}^{-1}r. $$

Applying the second part of Lemma 3.4, Lemma A.1, and Theorem 2.1 yields:

$$ \begin{array}{@{}rcl@{}} N-b^{*}K_{2}^{-1}b &\ge& 2\left( N-D_{n}(\tau/N)\right) - \left\Vert r\right\Vert^{2}\left\Vert K_{2}^{-1}\right\Vert\\ &\ge& \!\left( N - D_{n}(\tau/N)\right) \!\left( 2 - \left( N-D_{n}(\tau/N)\right)\left\Vert {K_{2}^{-1}}\right\Vert \right)- \!\left\Vert {K_{2}^{-1}}\right\Vert \sum\limits_{j=2}^{M-1}\left| {r_{j}} \right| ^{2} \\ &\ge& N\tau^{2} \left( 2-N\left\Vert K_{2}^{-1}\right\Vert\right) -\|K_{2}^{-1}\| N^{2}\tau^{2} \left( \frac{\pi^{4}}{12\rho^{2}}+\frac{1.21\pi}{\rho^{3}}+\frac{\pi^{4}}{180\rho^{4}}\right) \\ &\ge& N\tau^{2} \left[2- \frac{\rho}{\rho-1} \left( 1+\frac{\pi^{4}}{12\rho^{2}}+\frac{1.21\pi}{\rho^{3}}+\frac{\pi^{4}}{180\rho^{4}} \right)\right]. \end{array} $$

For ρ ≥ 5, the most inner bracketed term takes values in (1,1.4) such that the square bracketed term is positive. Forming the reciprocal gives the result, since Theorem 2.1 also implies:

$$ \begin{array}{@{}rcl@{}} N\left\Vert K_{2}^{-1}\right\Vert\le \frac{\rho}{\rho-1}\le \frac{\rho-1}{\rho-2}\le \left[2- \frac{\rho}{\rho-1} \left( 1+\hdots\right)\right]^{-1}. \end{array} $$
(3.4)

Theorem 3.6 (Upper bound)

Under the conditions of Definition 3.1 with \(\rho \ge \rho _{\min \limits }=6\), we have:

$$ \textnormal{cond}(A)\le \frac{4}{\tau}. $$

Proof

The bound follows from Lemmata 3.3 and 3.5 with C(ρ) ≤ C(6) ≤ 6.5. □

Lower and upper bounds in Theorems 2.2 and 3.6 yield:

$$ \frac{1}{\tau}\le\text{cond}(A)\le \frac{4}{\tau} $$

for τ ≤ 0.46 and 6 ≤ ρ. The condition on ρ implies that for specific configurations of M nodes, our result becomes effective as early as N ≈ 6M—this is in contrast to the results [4, 16], where N has to be much larger.

Remark 3.7 (Constants)

Some comments regarding what is lost during our proof:

  1. (i)

    The constant in Lemma 3.3 is a numerical value for all ρ ≥ 6, indeed the proof is valid for all values ρ > 1. The case M = 2 shows that Lemmata 3.3 and 3.4 are reasonably sharp since in this case \(\left \Vert {K}\right \Vert =N+D_{n}(\tau /N)\ge N(2-\pi ^{2}\tau ^{2}/6)\) and \(\left \Vert r\right \Vert =N-D_{n}(\tau /N)\ge N(2-\tau ^{2})\); see Lemma A.1 for the two inequalities.

  2. (ii)

    In Lemma 3.5, the constant C(ρ) is monotone decreasing in ρ; see also Fig. 3. It is bounded below by 3 which is due to the relatively crude norm estimate on the block triangular factors in the Schur complement decomposition. Note that the left-hand side in (3.3) is bounded from below by \(1+\left \Vert K_{2}^{-1}b\right \Vert ^{2}\). An additional minor improvement on C(ρ) and on the range of admissible values for ρ can be achieved when applying Lemma A.1 to two factors simultaneously.

Fig. 3
figure 3

C(ρ) in Lemma 3.5

Remark 3.8 (Generalizations and limitations)

In principle, the suggested Schur complement technique can be generalized to more than two nodes colliding and also to the multivariate case:

  1. (i)

    Let M ≥ 3 and \(0=t_{1}<\cdots <t_{M} \in \left [ 0,1 \right )\) be such that {t1,t2,t3} nearly collide and decompose:

    $$ \begin{array}{@{}rcl@{}} K &=&\begin{pmatrix} K_{1} & B^{*} \\ B & K_{2} \end{pmatrix},\quad K_{1}=\begin{pmatrix} N & {D_n\left( t_{1}-t_{2}\right)} \\ {D_n\left( t_{1}-t_{2}\right)} & N \end{pmatrix},\\ K_{2}&=&\begin{pmatrix} {D_n\left( t_{i}-t_{j}\right)} \end{pmatrix}_{i,j=3}^{M}. \end{array} $$

    While it is clear that the Schur complement \(K_{1}-B^{*} K_{2}^{-1} B\) is strictly positive definite, establishing a lower bound on its smallest singular value similar to the proofs of Lemmata 3.4 and 3.5 seems considerably harder. Already, the linear approximation in Lemma 3.4 then needs to be replaced by a higher order approximation for the matrix B.

  2. (ii)

    Consider the bivariate case and the Vandermonde matrix:

    $$ A= \begin{pmatrix} z_{j}^{\gamma} \end{pmatrix}_{ \begin{array}{c}j{=}1,{\hdots},M \\ \left\Vert \gamma\right\Vert_{\infty} {\le} n \end{array}} \in {\mathbb{C}}^{M\times N^{2}}, $$

    where \(z_{j}=(x_{j}, y_{j})=(\textnormal {e}^{-2\pi \textnormal {i}{u_{j}}},\textnormal {e}^{-2\pi \textnormal {i}{v_{j}}}) \in {\mathbb {T}}^{2}\), \(\gamma =(\alpha ,\beta )\in \mathbb {Z}^{2}\) is a multi-index, and \(z_{j}^{\gamma } := x_{j}^{\alpha } \cdot y_{j}^{\beta }\). The distance of the nodes tj = (uj,vj) ∈ [0,1)2 is measured by \(\left | {t_{j}-t_{\ell }} \right | _{{\mathbb {T}}}:= \min \limits _{r\in \mathbb {Z}^{2}}\left \Vert {t_{j}-t_{\ell }+r}\right \Vert _{\infty }\) and we consider the situation as in Definitions 3.1 and 3.2 with K = AA. Lemma 3.4 can be proven using the bivariate mean value theorem to get \(|r_{j}|\le N\tau \pi / \left | \xi _{j} \right |_{{\mathbb {T}}}\), j = 2,3,…,M, and the packing argument [14, Lem. 4.5] to get:

    $$ \left\Vert r\right\Vert^{2} \le (N^{2}- D_{n}(u_{2})D_{n}(v_{2}))^{2} + \frac{12\pi^{2} N^{4}\tau^{2}}{(\rho-1)^{2}} \left( 1+\log\left\lceil{\sqrt{M/6}}\right\rceil\right). $$

    We need additional assumptions for Lemma 3.5 to work since results for general well-separated nodes, cf. [15], seem to be too weak. If the nodes t2,…,tM are a subset of equispaced nodes in \({\mathbb {T}}^{2}\), then [14, Cor. 4.11] yields \(\left \Vert {K_{2}^{-1}}\right \Vert \le (N-N/\rho )^{-2}\). Together with M ≥ 4 and \(\rho \ge 4+2\log M\), this yields \(\left \Vert K^{-1}\right \Vert \le 20 / N^{2}\tau ^{2}\).

4 Pairs of nearly colliding nodes

We now study the situation in which the Vandermonde matrix comes from pairs of nearly colliding nodes.

Definition 4.1

Let \(n\in {\mathbb {N}}\), N = 2n + 1, c ≥ 1 and let \(t_{1}< {\cdots } < t_{\frac {M}{2}}\in \left [ 0,1 \right )\) and \(t_{\frac {M}{2}+1}<\cdots <t_{M} \in \left [ 0,1 \right )\) for M ≥ 4 even such that:

$$ \begin{array}{@{}rcl@{}} \frac{\tau}{N} &\le& \left| t_{j}-t_{j+\frac{M}{2}} \right|_{{\mathbb{T}}} \le \frac{c\tau}{N},\quad j=1,\dots,\frac{M}{2}, \qquad 0<c\tau\le 1,\\ \frac{\rho}{N} &\le&\left| t_{j}-t_{\ell} \right|_{{\mathbb{T}}}, \quad j < \ell, \ell \ne j+\frac{M}{2}, \qquad\qquad1<\rho < \infty, \end{array} $$

then {t1,…,tM} is called a set of nodes with pairs of nearly colliding nodes (see Fig. 4 for an illustration). The constant c measures the uniformity of the colliding nodes. For subsequent use, we additionally introduce the following wrap around distance of indices \(\left | {j-\ell } \right |^{\prime }:=\min \limits _{r\in \mathbb {Z}} \left | j-\ell +r\frac {M}{2} \right |\) with respect to \(\frac {M}{2}\).

Fig. 4
figure 4

Example of a node set with M = 8 satisfying Definition 4.1

Definition 4.2

We define:

$$ A_{1}\!:=\! \begin{pmatrix} {z_{j}^{k}} \end{pmatrix}_{ \begin{array}{c}j{=}1,{\dots},M/2\\ \left| k \right|{\le }n \end{array}}\in {\mathbb{C}}^{(M/2)\times N} \quad \!\text{and}\! \quad A_{2}\!:=\! \begin{pmatrix} {{z_{j}^{k}}} \end{pmatrix} _{\begin{array}{c}\!j{=}M/2{+}1,{\dots},M\!\\ \left| {k} \right| {\le} n \end{array}}\!\in {\mathbb{\!C}}^{(M/2)\times N} $$

so that with \(K_{1}:=A_{1}A_{1}^{*}\), \(K_{2}:=A_{2}A_{2}^{*}\), and \(B:=A_{2}A_{1}^{*}\) we have the partitioning:

$$ \begin{array}{@{}rcl@{}} A=\begin{pmatrix} A_{1}\\A_{2} \end{pmatrix}, \quad K=\begin{pmatrix} K_{1} & B^{*} \\ B & K_{2} \end{pmatrix}. \end{array} $$
(4.1)

Note that under the assumptions in Definition 4.1 the Vandermonde matrices A1 and A2 are each corresponding to nodes that are at least ρ/N-separated.

The proof technique we use is analogous to the one we used in the case of two nearly colliding nodes. The difference is that we have a matrix K1 instead of a scalar and the block B is a matrix instead of a vector. Subsequently, Lemma 4.3 establishes an upper bound on \(\left \Vert {K}\right \Vert \) and Lemmata 4.4, 4.5, and 4.6 establish an upper bound on \(\left \Vert K^{-1}\right \Vert \).

Lemma 4.3

Under the conditions of Definition 4.1, we have:

$$ \left\Vert K\right\Vert\le 2N \cdot\frac{\rho+1}{\rho}. $$

Proof

Similar to Lemma 3.3, we start by noting that \(\left \Vert B\right \Vert ^{2}\le \left \Vert K_{1}\right \Vert \left \Vert K_{2}\right \Vert \). Together with the decomposition (4.1), the triangle inequality, Lemma A.6, and Theorem 2.1, this leads to:

$$ \begin{array}{@{}rcl@{}} \left\Vert K\right\Vert &\le& \left\Vert \begin{pmatrix} K_{1}&0\\0&K_{2} \end{pmatrix}\right\Vert +\left\Vert \begin{pmatrix} 0&B^{*}\\{{B}}&0 \end{pmatrix}\right\Vert\\ &\le& \max\left\{ \left\Vert K_{1}\right\Vert,\left\Vert K_{2}\right\Vert \right\} + \sqrt{\left\Vert K_{1}\right\Vert\left\Vert K_{2}\right\Vert} \le 2N\cdot\frac{\rho+1}{\rho}. \end{array} $$

Lemma 4.4

Under the conditions of Definition 4.1, R1 := BK1 fulfills:

$$ \left\Vert R_{1}\right\Vert \le N- {D_n\left( c\tau/N\right)} + Nc\tau\left( \frac{\pi(\log\left\lfloor\frac{M}{4}\right\rfloor+1)}{\rho}+\frac{\pi^{2}}{6\rho^{2}}\right). $$

Proof

The Dirichlet kernel Dn is monotone decreasing on \(\left [ 0,1/N \right ]\). Hence, for the diagonal entries, we obtain:

$$ \left| (R_{1})_{jj} \right| = \left| D_n\left( t_{j}-t_{j+\frac{M}{2}}\right) -N \right| = N -{D_n}\left( t_{j}-t_{j+\frac{M}{2}}\right) \le N -{D_n}({c\tau/N}). $$

The off-diagonal entries are bounded by the mean value theorem and Lemma A.1 as:

$$ \begin{array}{@{}rcl@{}} \left| (R_{1})_{j\ell} \right| &=&\left| {D_n\left( t_{j}-t_{\ell}\right)} - {D_n\left( t_{j+\frac{M}{2}}-t_{\ell}\right)} \right|\\ &\le& \left| {D_n^{\prime}\left( \xi_{j\ell}\right)} \right| \frac{c\tau}{N} \le Nc\tau\left( \frac{\pi}{2 N\xi_{j\ell}} + \frac{1}{2N^{2}\xi_{j\ell}^{2}} \right), \end{array} $$

where \(\left ( \left | t_{j+\frac {M}{2}}-t_{\ell } \right |_{{\mathbb {T}}},\left | t_{j}-t_{\ell } \right |_{{\mathbb {T}}} \right )\ni \xi _{j\ell } \ge \left | {j-\ell } \right |^{\prime }\rho /N\) implies:

$$ \left| (R_{1})_{j\ell} \right| \le Nc\tau \left( \frac{\pi}{2\rho \left| j-\ell \right|^{\prime}} + \frac{1}{2\rho^{2}(\left| {j-\ell} \right| ^{\prime})^{2}} \right)=:(\widetilde R_{1})_{j\ell} $$

for \(j,\ell =1,\dots ,\frac {M}{2}\), j. Additionally, we set \((\widetilde R_{1})_{j j}:=N -D_{n}({c\tau /N})\). We bound the spectral norm of R1 by the one of the real symmetric matrix \(\widetilde R_{1}\) using Lemma A.2 and proceed by:

$$ \left\Vert R_{1}\right\Vert\le\left\Vert \widetilde R_{1}\right\Vert\le\left\Vert \widetilde R_{1}\right\Vert_{\infty} \le N-D_{n}({c\tau/N}) + 2Nc\tau \sum\limits_{j=1}^{\lfloor\frac{M}{4}\rfloor} \left( \frac{\pi}{2j\rho}+\frac{1}{2j^{2}\rho^{2}}\right), $$

from which the assertion follows. □

Lemma 4.5

Under the conditions of Definition 4.1, R1 = BK1 and R2 := BK2 fulfill:

$$ \left\Vert 2NI+R_{1}^{*}+R_{2}\right\Vert \le 2 {D_n\left( \tau/N\right)} + c^{2}\tau^{2} N\left( \frac{\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho} + \frac{\pi^{3}}{3\rho^{2}} + \frac{2.42}{\rho^{3}}\right). $$

Proof

First, note that:

$$ (R_{1}^{*}+R_{2})_{j\ell} = {D_n\left( t_{j+\frac{M}{2}}-t_{\ell}\right)} + {D_n\left( t_{j} - t_{\ell+\frac{M}{2}}\right)} - {D_n\left( t_{j+\frac{M}{2}}-t_{\ell+\frac{M}{2}}\right)} - {D_n\left( t_{j}\!-t_{\ell}\right)} . $$

Monotonicity of the Dirichlet kernel Dn on \(t\in \left [ 0,1/N \right ]\) gives:

$$ \left| (2NI+R_{1}^{*}+R_{2})_{jj} \right|=2\left| {D_n\left( t_{j+\frac{M}{2}}-t_{j}\right)} \right| \le 2{D_n}({\tau/N}) $$

for j = . For each fixed off-diagonal entry j, the matrix 2NI has no contribution. We write the node tj+M/2 as a perturbation of tj by hj := tj+M/2tj and expand the Dirichlet kernel by its Taylor polynomial of degree 2 in the point \(\hat h :=t_{j} - t_{\ell } + \frac {h_{j} - h_{\ell }}{2}\). Using:

$$ D_{n}(h)=D_{n}(\hat h)+D_{n}^{\prime}(\hat h)(h-\hat h)+\frac{D_{n}^{\prime\prime}(\xi)}{2}(h-\hat h)^{2} $$

for some \(\xi \in [\hat h,h]\cup [h,\hat h]\), the constant term, as well as the linear term, cancels out and we get:

$$ \begin{array}{@{}rcl@{}} &&D_{n}(t_{j}+h_{j}-t_{\ell}) + D_{n}(t_{j}-t_{\ell}-h_{\ell}) - D_{n}(t_{j}+h_{j} - t_{\ell}-h_{\ell}) - D_{n}(t_{j}-t_{\ell}) \\ &=&\frac{1}{8} \left( D_{n}^{\prime\prime}(\xi_{1}) (h_{j}+h_{\ell})^{2} + D_{n}^{\prime\prime}(\xi_{2}) (h_{j}+h_{\ell})^{2} + D_{n}^{\prime\prime}(\xi_{3}) (h_{j}-h_{\ell})^{2} + D_{n}^{\prime\prime}(\xi_{4}) (h_{j}-h_{\ell})^{2} \right). \end{array} $$

Lemma A.1 and \(\xi _{1},\dots ,\xi _{4} \ge \left | j-\ell \right |^{\prime }\rho /N\) imply:

$$ \begin{array}{@{}rcl@{}} \left| (R_{1}^{*}+R_{2})_{j\ell} \right| &\le &\frac{N^{3}}{4} \left( \frac{\pi^{2}}{2\left| {j-\ell} \right| ^{\prime}\rho}+\frac{\pi}{(\left| {j-\ell} \right| ^{\prime})^{2}\rho^{2}} + \frac{1}{ (\left|{j-\ell} \right| ^{\prime})^{3}\rho^{3}} \right)\\ &&\cdot \left( (h_{j}+h_{\ell})^{2}+(h_{j}-h_{\ell})^{2}\right) \end{array} $$

and hence by hj,hcτ/N

$$ \left| (2NI+R_{1}^{*}+R_{2})_{j\ell} \right| \le N c^{2}\tau^{2}\left( \frac{\pi^{2}}{2\left| {j-\ell} \right| ^{\prime}\rho}+\frac{\pi}{(\left| {j-\ell} \right| ^{\prime})^{2}\rho^{2}} + \frac{1}{ (\left| j-\ell \right|^{\prime})^{3}\rho^{3}} \right). $$

The matrix \(2NI+R_{1}^{*}+R_{2}\) is real symmetric so that:

$$ \begin{array}{@{}rcl@{}} \left\Vert 2NI+R_{1}^{*}+R_{2}\right\Vert &\le& \left\Vert 2NI+R_{1}^{*}+R_{2}\right\Vert_{\infty} \\ &\le& 2 {D_n\left( \tau/N\right)} + 2\sum\limits_{j=1}^{\lfloor\frac{M}{4}\rfloor} N c^{2}\tau^{2}\left( \frac{\pi^{2}}{2 j\rho}+\frac{\pi}{j^{2}\rho^{2}} + \frac{1}{j^{3}\rho^{3}} \right)\\ &\le& 2 {D_n\left( \tau/N\right)} + 2c^{2}\tau^{2} N\left( \frac{\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)}{2\rho} + \frac{\pi^{3}}{6\rho^{2}} + \frac{1.21}{\rho^{3}}\right) \end{array} $$

and therefore the result holds. □

Lemma 4.6

Under the conditions of Definition 4.1 with τ ≤ 1/2 and ρ ≥ 2, such that:

$$ \begin{array}{@{}rcl@{}} \tilde C(\tau,\rho,c,M) &:=& 2 - \frac{c^{2}\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho}-\frac{c^{2}\pi^{3}}{3\rho^{2}}-\frac{2.42c^{2} }{\rho^{3}} \\ &&-\frac{\rho}{(\rho-1)} \left( \frac{c^{2}\pi^{2}}{6}\tau + \frac{c\pi(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho}+\frac{c\pi^{2}}{6\rho^{2}} \right)^{2} \end{array} $$

is positive, we have:

$$ \left\Vert K^{-1}\right\Vert\le \frac{C(\tau,\rho,c,M)}{N\tau^{2}}, $$

where

$$ C(\tau,\rho,c,M):= \left( \frac{2\rho}{\rho-1}+\sqrt{\frac{\rho+1}{\rho-1}} \right)/\tilde C(\tau,\rho,c,M). $$

Figure 5 visualizes the values of the constant \(\tilde {C}(\tau ,\rho ,c,M)\) with respect to ρ and τ. Please note that (i) increasing the constant c by a factor \(\sqrt {2}\) has to be compensated approximately by halving τ and doubling ρ and (ii) increasing the number of nodes M from 4 to 64 has to be compensated approximately by tripling ρ.

Fig. 5
figure 5

Values of \(\tilde {C}(\tau ,\rho ,c,M)\) in Lemma 4.6 depending on τ and ρ for different M and c. Negative values are set to zero

Proof

We proceed analogously to Lemma 3.5 and apply Lemma A.4 to the matrix K decomposed as in (4.1) and obtain:

$$ \begin{array}{@{}rcl@{}} \|K^{-1}\| \le \max\{\|K_{1}^{-1}\|,\|(K_{2} - BK_{1}^{-1}B^{*})^{-1}\|\} \left\Vert \begin{pmatrix} I & 0 \\ -BK_{1}^{-1} & I \end{pmatrix}\right\Vert^{2}. \end{array} $$
(4.2)

Definition 4.2 and Theorem 2.1 yield:

$$ \left\Vert BK_{1}^{-1}\right\Vert\le \left\Vert A_{2}\right\Vert\left\Vert A_{1}^{\dagger}\right\Vert\le\sqrt{\frac{\rho+1}{\rho-1}}, $$

together with Lemma A.6, we obtain:

$$ \left\Vert \begin{pmatrix} I & 0 \\ -BK_{1}^{-1} & I \end{pmatrix}\right\Vert^{2} \le 1+\left\Vert {BK_{1}^{-1}}\right\Vert +\left\Vert {BK_{1}^{-1}}\right\Vert ^{2} \le \frac{2\rho}{\rho-1}+\sqrt{\frac{\rho+1}{\rho-1}}. $$

Now, we estimate \(\left \Vert (K_{2} - BK_{1}^{-1}B^{*})^{-1}\right \Vert \), which is done by the following steps:

  1. (i)

    First, note that \(I-A_{1}^{\dagger } A_{1}\) is an orthogonal projector and thus Theorem 2.1 implies:

    $$ \left\Vert K_{2}-BK_{1}^{-1}B^{*}\right\Vert \le \left\Vert A_{2}\right\Vert\left\Vert I-A_{1}^{\dagger} A_{1}\right\Vert\left\Vert A_{2}^{*}\right\Vert \le \left\Vert A_{2}\right\Vert^{2}<2N. $$

    We apply Lemma A.3 with η = 2N, use the identities R1 = BK1 and R2 = BK2, apply the triangular inequality, and the sub-multiplicativity of the matrix norm to get:

    $$ \begin{array}{@{}rcl@{}} \left\Vert (K_{2} - BK_{1}^{-1}B^{*})^{-1}\right\Vert \!&=&\! \frac{1}{2N - \left\Vert 2NI - K_{2}+BK_{1}^{-1}B^{*}\right\Vert}\\ \!&\le&\! \frac{1}{2N - \left\Vert 2NI + R_{1}^{*} + R_{2}\right\Vert - \left\Vert R_{1}\right\Vert^{2}\left\Vert K_{1}^{-1}\right\Vert}. \end{array} $$
    (4.3)
  2. (ii)

    Lemma 4.5 leads to:

    $$ \begin{array}{@{}rcl@{}} 2N-\left\Vert 2NI + R_{1}^{*}+R_{2}\right\Vert &\ge& 2(N- {D_n\left( \tau/N\right)} )\\ &&- c^{2}\tau^{2} N\left( \frac{\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho} + \frac{\pi^{3}}{3\rho^{2}} + \frac{2.42}{\rho^{3}}\right). \end{array} $$
  3. (iii)

    We apply Theorem 2.1 and Lemma 4.4 to get:

    $$ \begin{array}{@{}rcl@{}} \left\Vert R_{1}\right\Vert^{2} \left\Vert K_{1}^{-1}\right\Vert \le \frac{\rho}{N(\rho-1)} \left[ N- {D_n\left( c\tau/N\right)} + Nc\tau\left( \frac{\pi(\log\left\lfloor\frac{M}{4}\right\rfloor+1)}{\rho}+\frac{\pi^{2}}{6\rho^{2}}\right) \right]^{2}. \end{array} $$
  4. (iv)

    We use the estimates for the Dirichlet kernel \(N- {D_n\left (\tau /N\right )} \ge N\tau ^{2}\) in ii) and \(N-D_n({c\tau /N}) \le N\frac {\pi ^{2}}{6}c^{2}\tau ^{2}\) in iii) (see Lemma A.1), and insert this in (4.3) to get finally:

    $$ \begin{array}{@{}rcl@{}} \left\Vert (K_{2} - BK_{1}^{-1}B^{*})^{-1}\right\Vert &\le& \frac{1}{N\tau^{2}} \left[ 2 - \frac{c^{2}\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho}-\frac{c^{2}\pi^{3}}{3\rho^{2}}-\frac{2.42c^{2} }{\rho^{3}} \right.\\ &&-\!\left.\frac{\rho}{(\rho-1)} \left( \frac{\!c^{2}\pi^{2}}{6}\tau + \frac{c\pi(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho}+\frac{c\pi^{2}}{6\rho^{2}} \!\right)^{2} \right]^{-1}. \end{array} $$

This upper bound also bounds the maximum in (4.2) since for all τ ≤ 1/2 and ρ ≥ 2 together with Theorem 2.1

$$ \left\Vert K_{1}^{-1}\right\Vert\le \frac{2}{N} \le \frac{1}{2N \tau^{2}} \le \frac{1}{N\tau^{2}} [2- \cdots]^{-1}. $$

Theorem 4.7 (Upper bound)

Under the conditions of Definition 4.1 with M ≥ 4, \(\tau \le \tau _{max} =\frac {1}{4c^{2}}\) and \(\rho \ge \rho _{\min \limits } =10c^{2}(\log \lfloor \frac {M}{4} \rfloor +1)\), we have:

$$ \textnormal{cond}(A) \le \frac{5}{\tau}. $$

Proof

In Lemma 4.6, the constant C(τ,ρ,c,M) is monotone increasing in τ and monotone decreasing in ρ. Hence, after plugging in the bounds for τ and ρ in our assumptions, it is easy to see that the constant \(C(\frac {1}{4c^{2}},10c^{2}(\log \lfloor \frac {M}{4} \rfloor +1),c,M)\) is monotone decreasing in c and M, respectively. Therefore, we get C(τ,ρ,c,M) ≤ C(1/4,10,1,4) ≤ 11.3, so that \(\left \Vert {K^{-1}}\right \Vert \le 11.3N^{-1}\tau ^{-2}\). Together with the bound \(\left \Vert {K}\right \Vert \le 22N/10=2.2N\) from Lemma 4.3, we obtain the result. □

If each pair of nearly colliding nodes has the same separation distance, i.e., c = 1, we can improve the upper bound in the sense that restrictions on τ except for τ ≤ 1 can be dropped. In order to obtain the same constant, we have to increase the restrictions on ρ slightly.

Lemma 4.8

Under the conditions of Definition 4.1 with c = 1, such that:

$$ \begin{array}{@{}rcl@{}} \tilde C(\rho,M) &:=& 2 - \frac{\pi^{2}(\log\lfloor\frac{M}{4}\rfloor + 1)}{\rho} - \frac{\pi^{3}}{3\rho^{2}}- \frac{2.42}{\rho^{3}}\\ && -\frac{\rho}{\rho-1} - \frac{2\pi(\log\lfloor\frac{M}{4}\rfloor+1)}{(\rho-1)}-\frac{\pi^{2}}{3\rho(\rho-1)}\\ &&- \frac{\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)^{2}}{\rho(\rho-1)} -\frac{\pi^{3}(\log\lfloor\frac{M}{4}\rfloor+1)}{3\rho^{2}(\rho-1)}- \frac{\pi^{4}}{36\rho^{3}(\rho-1)} \end{array} $$

is positive, we have

$$ \left\Vert K^{-1}\right\Vert\le \frac{C(\rho,M)}{N\tau^{2}}, $$

where \(C(\rho ,M):= \left ( \frac {2\rho }{\rho -1}+\sqrt {\frac {\rho +1}{\rho -1}} \right ) / \tilde C(\rho ,M)\).

Proof

The proof is analogous to that of Lemma 4.6, the only difference is in step (iv). Setting c = 1 in (ii) and (iii), expanding the squared bracket in (iii) and inserting this into (4.3) leads to:

$$ \begin{array}{@{}rcl@{}} &&\left\Vert (K_{2} - BK_{1}^{-1}B^{*})^{-1}\right\Vert \le \left[ 2\left( N- {D_n\left( \tau/N\right)} \right)\right.\\ &&-N\tau^{2} \left( \frac{\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho}+\frac{\pi^{3}}{3\rho^{2}}+\frac{2.42}{\rho^{3}}\right) - \frac{\rho}{N(\rho-1)} \left( N- {D_n\left( \tau/N\right)} \right)^{2} \\ &&- \frac{\rho}{\rho-1} 2\tau\left( N- {D_n\left( \tau/N\right)} \right) \left( \frac{\pi(\log\lfloor\frac{M}{4}\rfloor+1)}{\rho}+\frac{\pi^{2}}{6\rho^{2}}\right)\\ &&\left.- N\tau^{2}\frac{\rho}{\rho-1} \left( \frac{\pi^{2}(\log\lfloor\frac{M}{4}\rfloor+1)^{2}}{\rho^{2}} +\frac{\pi^{3}(\log\lfloor\frac{M}{4}\rfloor+1)}{3\rho^{3}}+ \frac{\pi^{4}}{36\rho^{4}}\right) \right]^{-1}. \end{array} $$

In three summands, we can factor out \(N- {D_n\left (\tau /N\right )} \) and use the estimate NDn(τ/N) ≥ Nτ2, leading to a larger bound after inverting the expression in the end. Afterwards, in the third summand \(N- {D_n\left (\tau /N\right )} \) is left, for which we use the rough bound NDn(τ/N) ≤ N. In the fourth summand, we use τ ≤ 1 for the single τ. The same argument as in (3.4) shows that this also bounds the maximum in (4.2) and we get the result. □

Theorem 4.9 (Upper bound)

Under the conditions of Definition 4.1 with c = 1, \(\rho \ge \rho _{\min \limits } = 25 (\log \lfloor \frac {M}{4} \rfloor +1)\), we have:

$$ \textnormal{cond}(A) < \frac{5}{\tau}. $$

Proof

Direct inspection gives monotonicity of C(ρ,M) with respect to ρ and also the estimate \(C(25 (\log \lfloor {M}/{4} \rfloor +1),M)\le C(25,4) \le 12\). Hence, \(\left \Vert {K^{-1}}\right \Vert \le {12} N^{-1} \tau ^{-2}\) and together with the bound \(\left \Vert K\right \Vert \le 52N/25\) from Lemma 4.3 we obtain the result. □

Remark 4.10

Due to Lemma A.5, the upper bound from Theorem 4.7 remains valid if nodes are removed. We note in passing that \(\sigma _{\min \limits }\) and \(\sigma _{\max \limits }\) are monotone increasing with N and thus, condition number estimates for an even number N follow. Lower and upper bounds in Theorems 2.2 and 4.7 finally yield:

$$ \frac{1}{\tau}\le\text{cond}(A)\le \frac{5}{\tau} . $$

The lower bound is tight and the numerical value 5 in the upper bound follows from our proof technique and can be improved (see Fig. 6). The uniformity condition τ ≤ 1/(4c2) is artificial and, except for the special cases in Theorems 3.6 and 4.9, prevents letting τ → 1.

Fig. 6
figure 6

Numerical experiments for bounds on the condition number, lower bounds from Theorem 2.2; left, one nearly colliding pair, upper bound from Theorem 3.6; right, pairs of nearly colliding nodes, upper bound from Theorem 4.7

Moreover, the technical condition \(\rho \ge \rho _{\min \limits } = 25 (\log \lfloor \frac {M}{4} \rfloor +1)\) in Theorem 4.7 is due to the slow decay of the Dirichlet kernel and can be weakened by a preconditioning technique which however leads to a somewhat larger constant in the final result.Footnote 1 The diagonal matrix \(D=\text {diag}(1-{\left | k \right |}/{(n+1)})_{\left | k \right |\le n} \in {\mathbb {C}}^{N\times N}\) is positive definite with ∥D∥≤ 1 and thus the Rayleigh-Riesz characterization of the smallest eigenvalue for Hermitian matrices leads to:

$$ \begin{array}{@{}rcl@{}} \lambda_{\min}(ADA^{*}) &=&\min_{x\in{\mathbb{C}}^{M}, \left\Vert x\right\Vert=1} \left\Vert D^{1/2}A^{*} x\right\Vert^{2} \le \min_{x\in\mathbb{C}^{M}, \left\Vert {x}\right\Vert =1} \left\Vert {A^{*} x}\right\Vert ^{2} = \lambda_{\min}(K). \end{array} $$

The entries of the matrix ADA consist of Fejér kernel evaluations and analogously to Lemmata 4.4, 4.5, and 4.6 this yields (independently of M):

$$ \begin{array}{@{}rcl@{}} \left\Vert K^{-1}\right\Vert \!\!&\le&\!\! \frac{C(\tau,\rho,c)}{N\tau^{2}},\quad C(\tau,\rho,c)\!:=\! \left( \frac{6\rho^{2}}{3\rho^{2}-\pi^{2}}+\sqrt{\frac{3\rho^{2}+\pi^{2}}{3\rho^{2}-\pi^{2}}} \right)/\tilde C(\tau,\rho,c),\\ \tilde C(\tau,\rho,c) \!&:=&\! \frac{1}{8} \left( 2 - \frac{4c^{2}\pi^{4}}{3\rho^{2}} - \frac{39 c^{2}\pi }{\rho^{3}} - \frac{16c^{2}\pi^{4}+2c^{2}\pi^{6}}{45\rho^{4}} \right.\\ &&\left.\qquad-\frac{3\rho^{2}}{3\rho^{2}-\pi^{2}} \left( \frac{2c^{2}\pi^{2}}{9} \tau + \frac{2c\pi^{3}}{3\rho^{2}} + \frac{9.68 c}{\rho^{3}}\right)^{2} \right). \end{array} $$

Under the conditions of Definition 4.1 with n ≥ 1, \(\tau \le \tau _{max} =\frac {1}{4c^{2}}\) and \(\rho \ge \rho _{\min \limits } =11c^{2}\), we finally have cond(A) ≤ 14/τ. Note that this approach also allows to drop the logarithmic factor in Remark 3.8 (ii).

The absolute constant 5 in the upper bound of the condition number (or \(\tau \sqrt {N}\|A^{\dagger }\| \le \sqrt {11.3}\approx 3.4\)) follows from our proof technique and we give a numerical comparison to the approaches [3, 4, 8, 16] in Fig. 7. A short theoretical comparison including different assumptions on N, M, τ, and ρ is given below.

Fig. 7
figure 7

Upper bounds for \(\left \Vert A^{\dagger }\right \Vert \). Left, Comparison of Thm. 4.7 with [4, Cor. 3.6]; right, comparison of Thm. 4.7 with [16, Thm. 2], [8, Cor. 4.2] and its modification for our setting as mentioned in Remark 4.13

Remark 4.11 (Comparison to 4)

This approach is more general and allows more than two nodes in a group nearly colliding. The upper bound on the condition number grows quite strongly like CMM with the total number of nodes M, cf. [4, Cor. 3.6]. Moreover, the a priori conditions N ≥ 4M3 and ρ ≥ 2M (see [4, Cor. 3.6, left ineq. (3.4)]) are much stronger than ours. The uniformity condition τM/(2c) is slightly weaker than ours but nonetheless artificial.

Remark 4.12 (Comparison to 16)

Again, this approach is more general and allows more than two nodes in a group nearly colliding. The upper bound on the condition number grows like \(C\sqrt {M}\) with the total number of nodes M, cf. [16, Thm. 1 ineq. (2.3), Thm. 2 ineq. (2.7), and ineq. (2.8)]. With minor simplifications, the a priori conditions NM2 and:

$$ \begin{array}{@{}rcl@{}} \rho\ge \begin{cases} C_{1} \left( \frac{M}{\tau}\right)^{1/4}, & \text{with}~ C_{1}\approx 42,\\ C_{2} \left( \frac{M}{\tau}\right)^{1/2}, & \text{with}~ C_{2}\approx 63, \end{cases} \end{array} $$
(4.4)

are imposed; see [16, Thm. 1, ineq. (2.2), Thm 2, ineq. (2.5)]. We note that [16, Thm. 1, Thm. 2] places no upper bound except τ ≤ 1 but that condition (4.4) is in fact an a priori lower bound on τ which prevents τ → 0 already for moderate fixed M ≥ 3. Recently, we refined this approach in [13], dropped the mentioned dependencies on M and could weaken the condition (4.4) considerably.

Remark 4.13 (Comparison to 8)

This approach deals with pairs of nearly colliding nodes but differs completely from ours and the ones in [3, 4, 16], and rather generalizes the construction of certain extremal functions in [18] to pairs of nearly colliding nodes and subsets of them. The proven constant in the upper bound given in [8, Cor. 4.2] is \(\tau \sqrt {N}\|A^{\dagger }\|\le 9\sqrt {6}/\pi \approx 7.0\) and thus is slightly larger than ours (\(\sqrt {11.3}\approx 3.4\)). Using the stronger assumption on τ from our setting in the proof of [8, Thm. 3.6] and improving estimates in [8, Eq. (8)] provides the best result (≈ 1.7) for pairs of nearly colliding nodes. The conditions τ ≤ 1 and 3 ≤ ρ are quasi-optimal. Provided all technical results prove right, this approach is superior.

Remark 4.14 (Comparison to 3)

This approach uses a QR-decomposition technique to establish bounds on all singular values of Vandermonde matrices with nearly colliding nodes. Adapted to the case of pairs of nearly colliding nodes, we obtain the following: Let M ≥ 4 even and A as in Definition 4.1. With respect to the nearly colliding pairs, partition \(A^{*}=(A_{1}^{*} A_{2}^{*} \hdots A^{*}_{M/2})\) with QR decompositions \(A_{j}^{*}=Q_{j} R_{j}\) and set Q := (Q1Q2QM/2). Tracing back all constants in lemmata and proofs for the case of pairwise nearly colliding nodes, we obtain the uniform off-diagonal estimate:

$$ \begin{array}{@{}rcl@{}} \left| (Q^{*} Q)_{j,k} \right|&\le {150}/{\rho}+1079\tau, \qquad j\ne k, \end{array} $$

which yields a constant “multiplicative perturbation” in [3, Lem. 5.1] and thus a condition number estimate like Theorem 4.7 or [8] only if τC1/M and C2Mρ, for some constants C1,C2.

However, note that for two nearly colliding pairs u1 < u2v1 < v2, a direct computation (avoiding a so-called limit basis used in [3]) yields the off-diagonal estimate:

$$ \|Q_{1}^{*} Q_{2}\|_{\mathrm{F}}\le \frac{116}{N(v_{1}-u_{2})}, \qquad Q_{1}=Q_{1}(u_{1},u_{2}), Q_{2}=Q_{2}(v_{1},v_{2}). $$

Together with \(\rho \ge \frac {27}{23}\cdot 232(\log \left \lfloor \frac {M}{4}\right \rfloor +1)\) and Lemma A.6, this gives:

$$ \begin{array}{@{}rcl@{}} \left|1-\lambda_{r}(Q^{*}Q)\right| &\le& \underset{j}{\max} \sum\limits_{\ell=1}^{M/2} \|Q_{j}^{*} Q_{\ell}\|_{\mathrm{F}} \le 2 \sum\limits_{\ell=1}^{\left\lfloor{M/4}\right\rfloor} \frac{116}{\ell\rho}\\& \le& \frac{232(\log\left\lfloor\frac{M}{4}\right\rfloor+1)}{\rho} \le \frac{23}{27},\quad r=1,\dots,M. \end{array} $$

The Courant-Fisher min-max theorem [11, Thm. 4.2.6] and Weyl’s perturbation theorem [11, Thm. 4.3.1] finally yield:

$$ \text{cond}(A)\le \text{cond}(Q) \cdot \underset{j}{\max}\text{cond}(A_{j}) \le \frac{5}{\tau}. $$

Altogether, the improved variant of this technique can be used for nearly colliding pairs, but leads to a stronger assumption on ρ for all moderate uniformity constants c.

5 Numerical examples

All computations were carried out using MATLAB R2019b. As a test for the bounds in the case of one pair of nearly colliding nodes, we use the following configuration. Let the number of nodes M = 20 and M = 200 be fixed, respectively. Moreover, we choose N = 1 + 12(M − 1) which ensures that all nodes fit on the unit interval. We choose \(\tau \in \left [ {10^{-11},1} \right ]\) logarithmically uniformly at random and \(\rho _{3},\dots ,\rho _{M} \in \left [ 6,12 \right ]\) uniformly at random. Then, we set the nodes \(t_{1}<\cdots <t_{M} \in \left [ {0,1} \right )\) such that t1 = 0, t2 = τ/N and for \(j=3,\dots ,M\), \(\left | t_{j}-t_{j-1} \right |=\rho _{j}/N\). Afterwards, the condition number of the corresponding Vandermonde matrix is computed. This procedure is repeated 100 times and the results are presented in Fig. 6 (left).

For pairs of nearly colliding nodes, we use the following configuration. Let the number of nodes M = 20 and M = 200 be fixed, respectively. Moreover, we choose the parameter c = 2 and τmax and \(\rho _{\min \limits }\) as in Theorem 4.7. To ensure that all nodes fit on the unit interval, we choose N as the smallest odd integer bigger than \((c\tau _{max}+2\rho _{\min \limits })M/2\). Then, we choose \(\tau \in \left [ 10^{-11},1 \right ]\) logarithmically uniformly at random and set the nodes \(t_{1}< {\cdots } < t_{M} \in \left [ {0,1} \right )\) such that t1 = 0, t2 = τ/N and for \(j=3,\dots ,M\), \(\left | t_{j}-t_{j-1} \right |=\rho _{j}/N\) if j is odd or \(\left | {t_{j}-t_{j-1}} \right | =\tau _{j}/N\) if j is even, where \(\tau _{j}\in \left [ \tau ,c\tau \right ]\) and \(\rho _{j} \in \left [ \rho _{\min \limits },2\rho _{\min \limits } \right ]\) are picked uniformly at random, respectively. Afterwards, the condition number of the corresponding Vandermonde matrix is computed. This procedure is repeated 100 times and the results are presented in Fig. 6 (right). Note that Theorem 4.7 makes the restriction \(\tau \le \tau _{max}=\frac {1}{4}\), which seems to be an artifact of our proof technique.

In order to compare Theorem 4.7 with the results from [4, Cor. 3.6], we need to satisfy the assumptions of both results. We take M = 3 nodes with two nodes nearly colliding, i.e., t1 = 0, t1 = τ/N and t2 = t1 + ρ/N. The assumptions in [4, Cor. 3.6] make it necessary that the nodes lie on an interval of length \(\frac {1}{2M^{2}}=\frac {1}{18}\). We choose the parameter c = 1, \(\rho _{\min \limits } =12\), and N = 1001. Then, we pick \(\tau \in \left [ {10^{-11},1} \right ] \) logarithmically uniformly at random and \(\rho \in \left [ \rho _{\min \limits },\frac {N}{2M^{2}}-\tau \right ]\) uniformly at random. Afterwards, the inverse of the smallest singular value (norm of Moore-Penrose pseudo inverse) of the corresponding Vandermonde matrix is computed. This procedure is repeated 100 times and the results normalized by \(\tau \sqrt {N}\) are presented in Fig. 7 (left). From [4, Cor. 3.6], we get:

$$ \left\Vert A^{\dagger}\right\Vert \le \frac{2(2\pi)^{M-1} M^{2M-1}}{\pi}\cdot \frac{N\sqrt{N}}{(N-1)\sqrt{N-1}}\cdot\frac{1}{\tau\sqrt{N}} \approx 6116 \cdot \frac{1}{\tau\sqrt{N}} $$

for τ ≤ 1, whereas Theorem 4.7 provides \(\left \Vert A^{\dagger }\right \Vert \le \sqrt {11.3}\cdot \frac {1}{\tau \sqrt {N}} \approx 3.4 \cdot \frac {1}{\tau \sqrt {N}}\) for \(\tau \le \frac {1}{4}\).

In order to compare our results with the ones from the second version of [16, Thm. 1, Thm. 2], we set the parameter N = 215 + 1, c = 1 and M = 4 and M = 20, respectively. All pairs of nodes are placed uniformly, such that \(t_{j}=\frac {2j-2}{M}\) and \(t_{j+M/2}=t_{j}+\frac {\tau }{N}\) for \(j=1,\hdots ,\frac {M}{2}\), where τ is picked logarithmically uniformly at random from [10− 11,1]. Afterwards, the inverse of the smallest singular value (norm of Moore-Penrose pseudo inverse) of the corresponding Vandermonde matrix is computed. This procedure is repeated 100 times and the results normalized by \(\tau \sqrt {N}\) are presented in Fig. 7 (right). Note that [16, Thm. 1, ineq. (2.2), Thm. 2, ineq. (2.5)] restricts:

$$ \begin{array}{@{}rcl@{}} \tau \!\ge\! \frac{20^{2} M 2^{5} N^{3}}{\rho^{2} (N-1)^{3}} \!\approx\! \left\{\begin{array}{cc} 1.9\cdot 10^{-4},\\ 2.4 \cdot 10^{-2}, \end{array}\right.\qquad \tau \!\ge\! \frac{10^{4} 2^{10} M N^{5}}{\rho^{4} \pi(N-1)^{5}} \!\approx\! \left\{\begin{array}{cc} 1.8\cdot 10^{-10},\quad &M=4,\\ 5.6 \cdot 10^{-7},\quad &M=20, \end{array}\right. \end{array} $$

respectively, where we used the uniform bound \(\rho < \frac {2N}{M}\). The results are shown in Fig. 7 (right) by proper lines [16, Thm. 2, ineq. (2.5)] and by broken lines [16, Thm. 1, ineq. (2.2)]. In both cases and with minor corrections, the resulting estimate is:

$$ \begin{array}{@{}rcl@{}} \left\Vert A^{\dagger}\right\Vert &\le& \frac{20\sqrt{2}}{19} \left( 1-\frac{\pi^{2}}{12}\right)^{-1/2} \frac{N-1}{2}\left\lfloor{\frac{N-1}{2}}\right\rfloor ^{-1}\frac{4}{\pi} \sqrt{M} \frac{\sqrt{N}}{\sqrt{N-1}}\cdot \frac{1}{\tau\sqrt{N}}\\ &\approx& \begin{cases} 9 \cdot \frac{1}{\tau\sqrt{N}},\quad & M=4,\\ 20.1 \cdot \frac{1}{\tau\sqrt{N}},\quad &M=20, \end{cases} \end{array} $$

whereas Theorem 4.7 provides again \(\left \Vert A^{\dagger }\right \Vert \le 3.4 \cdot \frac {1}{\tau \sqrt {N}}\) for \(\tau \le \frac {1}{4}\). We note that our bound remains valid for c > 1 but the restriction on τ becomes more severe.

6 Summary

We proved upper and lower bounds for the spectral condition number of rectangular Vandermonde matrices with nodes on the complex unit circle. If pairs of nodes nearly collide, the studied condition number grows linearly with the inverse separation distance. In contrast to the more general results [4, 16], we provide reasonable sharp and absolute constants but have to admit that our technique most likely will not generalize to more than two nodes nearly colliding. Note that our easy to achieve lower bound seems to capture the situation more accurately than the upper bound. We posed mild technical conditions in our proofs, which cannot be confirmed to be necessary numerically. While [4] provided the right growth order for the first time, some of the imposed conditions are very restrictive and the involved constants are quite pessimistic. The second version of [16] provided a quite general framework and presented decent results with only a mild artificial growth of the condition number with respect to the number of nodes. Moreover, a technical condition there prevents the separation distance from going to zero for a fixed number of nodes and a fixed bandwidth. We believe that both problems can be fixed at least partially and thus [16] seems to be a good framework for understanding node configurations with nearly colliding nodes. Recently, the manuscript [8] came to our attention—it considers pairs of nearly colliding nodes and weakens the assumptions considerably and gives, after modifications, stronger bounds on the smallest singular value. The taken approach differs completely from ours and the ones in [4, 16], but rather generalizes the construction of [18] to pairs of nearly colliding nodes.