1 Introduction

Spherical approximation is a topic of immense interest and the use of positive definite spherical basis functions has recently been discussed in a tremendous number of publications [8, 14, 16, 23]. Most of the known results are for isotropic positive definite kernels, which are kernels that only depend on the geodesic distance of their arguments. Isotropic kernels are the generalisation of the radial basis function method to the sphere and therefore referred to in approximation theory as spherical radial basis functions (see [16] and references therein). They are also of importance in statistics where they occur as correlation functions of homogeneous random fields on spheres [18, 19].

In Euclidean space the study of general (non radial) kernel methods has gained importance, see for example [13]. Our paper aims to provide a starting point for such general kernel methods on spheres by stating a general series representation of kernels on the sphere and first establishing connections between geometric properties of the kernel (like isotropy or axial symmetry) and the series representation. Then positive definiteness and strict positive definiteness of such kernels are studied.

We summarise the existing results on kernel methods, these mainly focus on the 2-sphere. Recently, the use of (non-isotropic) kernels on the 2-sphere was suggested and applied for the approximation of global data in [5, 11, 12, 24], the latter three papers using non isotropic kernels on the sphere or the sphere cross-time. Conditions that ensure strict positive definiteness of axially-symmetric kernels on the 2-sphere were proven in [6]. These results are in this paper generalised to d-dimensional spheres and additional necessary conditions are proven.

The second type of non-radial kernels we study are convolutional kernels which have for example been described in [7, 25], where the second reference includes conditions for strict positive definiteness similar to the ones we will present but only for the case of the two-sphere.

The new results we present refer to higher-dimensional spheres, too, these have to be studied separately since the basis functions used for the description of the kernels have a more complicated structure in higher dimensions. Also we simplify the technique of the proof for the convolutional kernels by not using a fixed basis of the spherical harmonics.

The results for convolutional kernels are closely connected to the ones known for isotropic kernels and easier to verify than the abstract results for axially symmetric kernels. However, the convolutional kernels contain a subclass of the axially symmetric kernels which allows the use of the more applicable results for the construction of axially symmetric kernels.

We will briefly summarise the needed definitions in the first section and then, starting from a general form of the kernel, state conditions which ensure certain properties of the kernel, like axial symmetry, convolutional form and invariance under parity. In the third section, we give sufficient conditions for axially symmetric kernels to be positive definite and strictly positive definite and add some necessary conditions. In the fourth part of this paper we generalise the result of Chen et al. [8] on radial kernels to convolutional kernels deriving sufficient conditions on strict positive definiteness and finally study the special case of the circle in the last section.

1.1 Problem Description and Background

We focus on interpolation problems on the d-sphere

$$\begin{aligned} {{\mathbb {S}}^{d-1}}=\left\{ \xi \in {\mathbb {R}}^d \,\bigg | \ \xi _1^2+\xi _2^2 +\cdots +\xi _d^2=1 \right\} ,\quad d\ge 2, \end{aligned}$$

where a finite set of distinct data sites \(\varXi \subset {{\mathbb {S}}^{d-1}}\) and values \(f\left( \xi \right) \in {\mathbb {C}}\), \(\xi \in \varXi \), of a possibly elsewhere unknown function f on the sphere are given.

The approximant is formed as a linear combination of kernels

$$\begin{aligned} K:{{\mathbb {S}}^{d-1}}\times {{\mathbb {S}}^{d-1}}\rightarrow {\mathbb {C}}. \end{aligned}$$

Taking the form

$$\begin{aligned} s_f\left( \zeta \right) =\sum _{\xi \in \varXi } c_{\xi } K \left( \zeta , \xi \right) , \qquad \zeta \in {{\mathbb {S}}^{d-1}}, \end{aligned}$$
(1)

the problem of finding such an approximant \(s_f\) satisfying

$$\begin{aligned} s_f\left( \xi \right) =f\left( \xi \right) , \qquad \forall \xi \in \varXi , \end{aligned}$$
(2)

is uniquely solvable under certain conditions on K. We assume all the kernels to be Hermitian, meaning they satisfy \(K\left( \xi ,\zeta \right) =\overline{K\left( \zeta , \xi \right) }\), so that the positive definiteness of the kernel will ensure the solvability of the interpolation problem for arbitrary data sets.

Definition 1

A Hermitian kernel \(K: \varOmega \times \varOmega \rightarrow {\mathbb {C}}\) is called positive definite on \(\varOmega \) if the matrix \(K_{\varXi }=\left\{ K\left( \xi ,\zeta \right) \right\} _{\xi , \zeta \in \varXi }\) is positive semi-definite on \({\mathbb {C}}^{\vert \varXi \vert }\) for arbitrary finite sets of distinct points \(\varXi \subset \varOmega \).

The kernel is strictly positive definite if \(K_{\varXi }\) is a positive definite matrix on \({\mathbb {C}}^{\vert \varXi \vert }\) for arbitrary finite sets of distinct points \(\varXi \).

We assume that K is square-integrable and can be represented as a series

$$\begin{aligned} K\left( \xi , \zeta \right) =\sum _{j,j'=0}^{\infty } \sum _{k=1}^{N_{j,d}} \sum _{k'=1}^{N_{j',d}} a_{j,j',k,k'}Y_j^k\left( \xi \right) \overline{Y_{j'}^{k'} \left( \zeta \right) },\qquad \forall \xi ,\zeta \in {{\mathbb {S}}^{d-1}}, \end{aligned}$$
(3)

where the \(Y_j^k\) form an orthonormal basis of the eigenfunctions of the Laplace–Beltrami operator on the sphere and the corresponding eigenvalues increase with j. The \(a_{j,j',k,k'}\) are complex numbers and square-summable as a consequence of the square-integrability of K. The eigenfunctions are the solutions of the eigenfunction problem

$$\begin{aligned} \lambda f +\triangle f=0, \end{aligned}$$

where \(\triangle \) is the Laplace–Beltrami operator on the sphere. For example on the 2-sphere such a basis can take the form

$$\begin{aligned} Y_j^k\left( \xi \right) = \frac{1}{\sqrt{2\pi }} \sqrt{\frac{2j+1}{2} \frac{\left( j-k\right) !}{\left( j+k\right) !}} P_{j}^k\left( \cos \left( \theta \right) \right) e^{ik \varphi }, \end{aligned}$$
(4)

where \((\theta ,\varphi )\in [0,2\pi )\times [0,\pi )\) are the polar coordinates representing \(\xi \), and \(P_{j}^k\) are the associated Legendre polynomials (see for example 8.1.2, [1]).

The eigenfunctions corresponding to the eigenvalues \(\lambda _{j}=j\left( j+d-1\right) \) are spherical harmonics and the number of eigenfunctions corresponding to the eigenvalue \(\lambda _j\) is denoted by \(N_{j,d}\). The numbers are given by \(N_{0,d}=1\),

$$\begin{aligned} N_{j,d}=\frac{\left( 2j+d-2\right) \left( j+d-3\right) !}{j!\left( d-2\right) !}. \end{aligned}$$

We will denote the space of all spherical harmonics corresponding to the eigenvalue \(\lambda _j\) by \(H_j:={\text{ span }} \{ Y_{j}^k \mid k=1,\ldots ,N_{j,d}\}\) Additionally we will use the following estimate which holds for any orthonormal basis and is a direct consequence of the Poisson summation formula ([3], Equation (2.35))

$$\begin{aligned} \vert Y_j^{k}\left( \xi \right) \vert \le \sqrt{\frac{N_{j,d}}{\sigma _{d-1}}}, \quad \forall \xi \in {{\mathbb {S}}^{d-1}}, \end{aligned}$$
(5)

where \(\sigma _{d-1}\) is the surface area of \({{\mathbb {S}}^{d-1}}\).

It is well known and used in the characterisation above that every function in \(L^2\left( {{\mathbb {S}}^{d-1}}\right) \) can be represented as a spherical harmonic expansion of the form

$$\begin{aligned} g\left( \xi \right) = \sum _{j=0}^{\infty }\sum _{k=1}^{N_{j,d}} \hat{g}_{j,k}Y_{j}^k\left( \xi \right) ,\qquad \text{ with } \hat{g}_{j,k}=\int _{{{\mathbb {S}}^{d-1}}}g\left( \zeta \right) \overline{Y_{j}^k \left( \zeta \right) }\,d \sigma \left( \zeta \right) , \end{aligned}$$
(6)

where \(d\sigma \) is the surface area measure on \({{\mathbb {S}}^{d-1}}\). For this expansion the Parseval equation for spherical harmonics holds ([3], (2.143)):

$$\begin{aligned} \Vert g\Vert _{L^2\left( {{\mathbb {S}}^{d-1}}\right) }^2=\sum _{j=0}^{\infty } \sum _{k=1}^{N_{j,d}} \vert \hat{g}_{j,k}\vert ^2. \end{aligned}$$

2 Kernels with Special Coefficient Structure

We now fix a basis for our space of spherical harmonics such that the kernels which are still Hermitian and square-integrable are represented as in (3). We demonstrate in this section how geometric properties of the kernel and the coefficient structure of the series representation are connected. Thereby imposing certain conditions on the structure of the coefficients allows us to focus on kernels with specific properties in the next sections.

The case studied most often is assuming that the kernel is isotropic [26], which means it only depends on the distance of its two arguments and not on their position on the sphere. The coefficients of an isotropic positive definite kernel given in the form (3) satisfy

$$\begin{aligned} a_{j,j',k,k'}=c_j \delta _{j',j} \delta _{k,k'}\ge 0 \end{aligned}$$

as stated by Schoenberg [26]. A characterisation of the strictly positive definite kernels of this form was presented by Chen et al. in [8].

Recently, a sufficient condition for axially symmetric kernels on the two-sphere to be strictly positive definite was presented in [6]. The axially symmetric kernels on \({\mathbb {S}}^{2}\) do depend on the difference in longitude of the two inputs \(\xi ,\zeta \) and their individual values of latitude. The coefficients of these kernels satisfy

$$\begin{aligned} a_{j,j',k,k'}=c_k\left( j,j'\right) \delta _{k,k'}, \end{aligned}$$

where the basis of the spherical harmonics on the 2-sphere is chosen as (4). The condition was first stated in [17].

For the study of properties of the kernel we need the kernel of the form (3) to be uniquely recoverable from its Fourier expansion on \({{\mathbb {S}}^{d-1}}\times {{\mathbb {S}}^{d-1}}\) we also require the series to be absolutely summable. Therefore we start by establishing that the latter implies the former.

Lemma 1

Let K be a kernel of the form (3). If

$$\begin{aligned} \sum _{j,j'=0}^{\infty }\sum _{k,k'=1}^{N_{j,d},N_{j',d}} \vert a_{j,j',k,k'}\vert \sqrt{\frac{N_{j,d}N_{j',d}}{\sigma _{d-1}^2}} <\infty , \end{aligned}$$
(7)

then, for each combination of \(j,j'\in \mathbb {N}_0\), \(1\le k \le N_{j,d}, 1\le k'\le N_{j',d}\), the coefficients \(a_{j,j',k,k'}\) can be uniquely determined by

$$\begin{aligned} a_{j,j',k,k'}=\int _{{{\mathbb {S}}^{d-1}}}\int _{{{\mathbb {S}}^{d-1}}}K\left( \xi ,\zeta \right) Y_j^k\left( \xi \right) \overline{Y}_{j'}^{k'}\left( \zeta \right) \, d\sigma \left( \xi \right) \,d\sigma \left( \zeta \right) . \end{aligned}$$
(8)

Proof

The result follows by inserting the kernel representation (3) into (8) and then using the estimate (5) to exchange the order of summation and integration since it implies absolute summability. \(\square \)

We note that any kernel defined using coefficients satisfying property (7) will be continuous, as the upper bound is absolutely summable and all basis function are continuous. We will from now on refer to series kernels given in the form (3) which satisfy (7) simply as kernels with property (7).

First we note

Lemma 2

Let K be a kernel with property (7). It is Hermitian if and only if

$$\begin{aligned} a_{j,j',k,k'}=\overline{a_{j',j,k',k}} \end{aligned}$$
(9)

for all possible choices \(j,j'\in {\mathbb {N}}\) and \(1\le k \le N_{j,d}, 1\le k'\le N_{j',d}.\)

Proof

Assuming K is Hermitian implies that

$$\begin{aligned} \overline{a_{j',j,k',k}}=\int _{{{\mathbb {S}}^{d-1}}}\int _{{{\mathbb {S}}^{d-1}}} \overline{K\left( \xi ,\zeta \right) }\,\overline{Y}_{j'}^{k'} \left( \xi \right) {Y}_{j}^{k}\left( \zeta \right) \, d\sigma \left( \xi \right) \,d\sigma \left( \zeta \right) =a_{j,j',k,k'} \end{aligned}$$

according to (8). The other direction follows from representation (3). \(\square \)

Besides axially symmetric kernels we are interested in kernels which are referred to as convolutional kernels. For a fixed basis of spherical harmonics these are expressed as kernels with an eigenvalue block structure but we give a non basis dependent definition.

Lemma 3

A Hermitian kernel \(K\in L^2\left( {{\mathbb {S}}^{d-1}}\times {{\mathbb {S}}^{d-1}}\right) \) of the form (3) satisfies

$$\begin{aligned} \int _{{{\mathbb {S}}^{d-1}}}K\left( \xi ,\zeta \right) \overline{Y\left( \xi \right) }d \sigma (\xi ) \in H_j, \quad \forall \ Y\in H_j, \end{aligned}$$

if and only if

$$\begin{aligned} a_{j,j',k,k'}=\delta _{j,j'}d_j\left( k,k'\right) . \end{aligned}$$

Proof

It is sufficient to prove the implication for the basis \(Y_{j}^k\), \(k=1,\ldots ,N_{j,d}\), because of the linearity of the integral. For these

$$\begin{aligned} \int _{{{\mathbb {S}}^{d-1}}}K\left( \xi ,\zeta \right) \overline{Y_{j}^k\left( \xi \right) }d \sigma (\xi )=\sum _{j'=0}^{\infty } \sum _{k'=1}^{N_{j',d}}a_{j,j',k,k'} Y_{j'}^{k'}\left( \zeta \right) , \end{aligned}$$

which is an element of \(H_j\) if and only if \(a_{j,j',k,k'}=\delta _{j,j'}d_j\left( k,k'\right) \). \(\square \)

Kernels of this form, we call them convolutional kernels, are invariant under parity, meaning \(K\left( \xi ,\zeta \right) =K\left( -\xi ,-\zeta \right) \), and the special structure allows to determine easily if an interpolant derived using such a kernel is included in certain Sobolev spaces, as for example studied in [22].

The kernels have been mostly discussed without the selection of a fixed basis of the spherical harmonics of a certain order. In this case one can choose the basis of \(H_j\), denoted by \(\tilde{Y}_j^k\left( \xi \right) \) in a way that

$$\begin{aligned} K\left( \xi , \zeta \right) =\sum _{j=0}^{\infty } \sum _{k=1}^{N_{j,d}} d_{j,k} \tilde{Y}_j^k \left( \xi \right) \overline{\tilde{Y}_{j}^{k} \left( \zeta \right) },\qquad \forall \xi ,\zeta \in {{\mathbb {S}}^{d-1}}. \end{aligned}$$
(10)

Under the above conditions and for \(f\in L^2\left( {{\mathbb {S}}^{d-1}}\right) \) the convolution operator with kernel K is

$$\begin{aligned} Tf\left( \xi \right) =\int _{{{\mathbb {S}}^{d-1}}}K\left( \xi ,\zeta \right) f \left( \zeta \right) d\sigma \left( \zeta \right) , \end{aligned}$$

and the Fourier coefficients of Tf as in (6), when computed with respect to the basis \(\tilde{Y}_j^k,\) satisfy \(\widehat{\tilde{Tf}}_{j,k}=d_{j,k} \hat{\tilde{f}}_{j,k}\). We nevertheless include the slightly more complex block structure, since we believe it to be helpful when we want to construct new kernels using our results. For example we may be using the fixed basis of the spherical harmonics on the 2-sphere as mentioned.

For completeness we include the conditions on the coefficients which ensure invariance under parity. One can easily verify that all restrictions of shift-invariant, Hermitian kernels in \({\mathbb {R}}^d\) to the surface of the sphere are invariant under parity.

Lemma 4

(Parity-invariance) A kernel with property (7) satisfies,

$$\begin{aligned} K\left( \xi ,\zeta \right) =K\left( -\xi ,-\zeta \right) , \quad \forall \xi ,\zeta \in {{\mathbb {S}}^{d-1}}, \end{aligned}$$

if and only if

$$\begin{aligned} a_{j,j',k,k'}=0, \quad \forall j+j'\ne 0\mod 2. \end{aligned}$$
(11)

Proof

The result follows immediately using the property of homogenity of the spherical harmonics:

$$\begin{aligned} K\left( -\xi ,-\zeta \right) =\sum _{j=0}^{\infty }\sum _{k=1}^{N_{j,d}} \sum _{j'=0}^{\infty }\sum _{k'=1}^{N_{j,d}} a_{j,j',k,k'} \left( -1\right) ^{j+j'}\overline{Y_{j'}^{k'}} \left( \zeta \right) Y_{j}^{k}\left( \xi \right) , \quad \forall \xi ,\zeta \in {{\mathbb {S}}^{d-1}}. \end{aligned}$$

This is equal to \(K\left( \xi ,\zeta \right) \) if and only if all coefficients are equal, meaning \( a_{j,j',k'k'}=0\), for all pairs of \(j,j'\) with odd sum. \(\square \)

The previous conditions are independent of the choice of orthogonal basis \(Y_j^k\). For axial symmetry we will have to fix a certain basis. For the existing results for \(d=3\) this was (4) where axial symmetry with respect to \(\varphi \) led to the coefficients satisfying \(a_{j,j',k,k'}=\delta _{k,k'} c_{j,j'}\). On general spheres with \(d\ge 3\) we will focus on axial symmetry with respect to just one of the \(d-1\) axes and we assume that the sphere has been rotated such that this is the axis which is represented by the first coordinate in the polar coordinate representation of \(\xi \in {{\mathbb {S}}^{d-1}}\), which we denote by \(\left( \theta _1,\ldots ,\theta _{d-1}\right) ^T \in [0,2\pi )\times [0,\pi )^{d-2}\). The spherical harmonics of degree \(\ell _{d-1}\) can explicitly be given by

$$\begin{aligned} Y_{\ell _1,\ldots ,\ell _{d-1}}\left( \xi \right) =\frac{1}{\sqrt{2\pi }}e^{i\ell _1\theta _1} \prod _{j=2}^{d-1}{}_{j}\tilde{P}_{\ell _j}^{\ell _{j-1}} \left( \theta _{j}\right) , \end{aligned}$$
(12)

where \(\ell _1,\ldots ,\ell _{d-1}\) are integers satisfying

$$\begin{aligned} \ell _{d-1}\ge \cdots \ge \vert \ell _1\vert \end{aligned}$$

and

$$\begin{aligned} {}_{j}\tilde{P}_{L}^{\ell }(\theta )={}_{j}c_L^{\ell } \left( \sin \left( \theta \right) \right) ^{-\left( 2-j\right) /2} P_{L+\left( j-2\right) /2}^{-\left( \ell +\left( j-2\right) /2\right) } \left( \cos \left( \theta \right) \right) , \end{aligned}$$

where \(P_{\nu }^{\mu }\) are the associated Legendre functions and

$$\begin{aligned} {}_{j}c_L^{\ell }:= \left( \frac{2L+j-1}{2} \frac{\left( L+\ell +j-2\right) !}{\left( L-\ell \right) !}\right) ^{1/2}. \end{aligned}$$

The formula is taken from [15], Equation (2.5). To be able to handle the formula better we define the last part of the spherical harmonics as \(\ell :=\left( \ell _2,\ldots ,\ell _{d-2}, \ell _{d-1}\right) \), \(\theta ':=\left( \theta _2,\ldots , \theta _{d-1}\right) \) and

$$\begin{aligned} p_ {\ell _1,\ell }\left( \theta '\right) = \frac{1}{\sqrt{2\pi }} \prod _{j=2}^{d-1}{}_{j}\tilde{P}_{\ell _j}^{\ell _{j-1}} \left( \theta _{j}\right) . \end{aligned}$$

Further we set

$$\begin{aligned} \varLambda _{\vert \ell _1\vert }:=\left\{ \left( \ell _2,\ldots , \ell _{d-2},\ell _{d-1}\right) \in {\mathbb {N}}^{d-2} \big \vert \, \vert \ell _1\vert \le \ell _2 \le \cdots \le \ell _{d-2}\le \ell _{d-1} \right\} \end{aligned}$$

and will use \(Y_{\ell _1,\ell }\left( \xi \right) \), with \(\ell =\left( \ell _2,\ldots ,\ell _{d-1}\right) \in \varLambda _{\vert \ell _1\vert }\) as the abbreviation for \(Y_{\ell _1,\ell _2,\ldots ,\ell _{d-1}}\left( \xi \right) \), for \(a_{\ell _1,\ldots ,\ell _{d-1},\ell ' _1,\ldots ,\ell '_{d-1}}\) respectively. As in the definition of the spherical harmonics, we need to use the points on \({{\mathbb {S}}^{d-1}}\) and their polar coordinates simultaneously, therefore we note that \(\xi \) has the polar coordinate representation \(\left( \theta _1, \ldots ,\theta _{d-1}\right) \) and \(\zeta \) is given in polar coordinates as \(\left( \nu _1,\ldots ,\nu _{d-1}\right) \). We will consistently use this notation in the paper. Assuming a kernel has property (7) for absolute summability then it can be represented as:

$$\begin{aligned} K\left( \xi ,\zeta \right) =\frac{1}{2\pi }\sum _{\ell _{1}, \ell _{1}'=-\infty }^{\infty }\ \sum _{\ell \in \varLambda _{\vert \ell _1\vert }} \ \sum _{\ell '\in \varLambda _{\vert \ell _1'\vert }} a_{\ell _1,\ell ,\ell '_1,\ell '}e^{i(\ell _1\theta _1-\ell '_1\nu _1)} p_ {\ell _1,\ell }\left( \theta '\right) \left( \xi \right) \overline{p_ {\ell _1',\ell '}\left( \nu '\right) } , \end{aligned}$$
(13)

by reordering of the summands.

Theorem 1

(Axial symmetry with respect to one axis) A kernel with property (7), given in the form

$$\begin{aligned} K\left( \xi ,\zeta \right) =\sum _{\ell _{1},\ell _{1}'=-\infty }^{\infty } \sum _{\ell \in \varLambda _{\vert \ell _1\vert }} \ \sum _{\ell '\in \varLambda _{\vert \ell _1'\vert }} a_{\ell _1,\ell ,\ell '_1,\ell '}\ Y_{\ell _1,\ell } \left( \xi \right) \overline{Y_{\ell _1',\ell '}} \left( \zeta \right) ,\quad \xi ,\zeta \in {{\mathbb {S}}^{d-1}}, \end{aligned}$$

is axially symmetric with respect to the \(\theta _1\) axis if and only if

$$\begin{aligned} a_{\ell _1,\ell ,\ell _1',\ell '} =\delta _{\ell _1,\ell '_1}c_ {\ell _1}(\ell , \ell '). \end{aligned}$$
(14)

Proof

The rotation in the \(\theta _1\)-axis by the angle \(\alpha \) will be denoted as \(R_{\alpha }\). Then using (13):

$$\begin{aligned} K\left( R_{\alpha }\xi ,R_{\alpha }\zeta \right)&=\sum _{\ell _{1},\ell _{1}' =-\infty }^{\infty }\ \sum _{\ell \in \varLambda _{\vert \ell _1\vert }} \ \sum _{\ell '\in \varLambda _{\vert \ell _1'\vert }} a_{\ell _1,\ell , \ell '_1,\ell '}\frac{1}{2\pi }e^{i\ell _1\left( \theta _1+\alpha \right) -i\ell _1'\left( \nu _1+\alpha \right) } \nonumber \\&\quad \times p_ {\ell '_1,\ell }\left( \theta '\right) \left( \xi \right) \overline{p_ {\ell '_1,\ell '}\left( \nu '\right) }\nonumber \\&=\sum _{\ell _{1},\ell _{1}'=-\infty }^{\infty } e^{-i\alpha \left( \ell _1-\ell '_1\right) }\times \sum _{\ell \in \varLambda _{\vert \ell _1\vert }} \ \sum _{\ell '\in \varLambda _{\vert \ell _1'\vert }} a_{\ell _1,\ell ,\ell '_1,\ell '}Y_{\ell _1, \ell } \left( \xi \right) \overline{Y_{\ell _1',\ell }}\left( \zeta \right) .\nonumber \\ \end{aligned}$$
(15)

We see that the above condition is sufficient for axial symmetry. It is also necessary since according to Lemma 1 equality is only possible if the coefficients are all equal and this can only hold for all \(\alpha >0\) if \(a_{\ell _1,\ell , \ell _1',\ell '}= \delta _{\ell _1,\ell '_1}c_ {\ell _1}(\ell ,\ell ').\) \(\square \)

As expected, the axially symmetric kernels can be written as a function depending on the values \(\theta '\),\(\nu '\) and the longitudinal difference \(\theta _1-\nu _1\). In [11] additionally the properties of being longitudinal independent and longitudinal-reversible are introduced.

Definition 2

  • An axially symmetric kernel is called longitudinal reversible if there exists a map \(\tilde{K}:[0,2\pi )\times [0,\pi )^{d-2} \times [0,\pi )^{d-2}\rightarrow {\mathbb {C}}\) with

    $$\begin{aligned} K\left( \xi ,\zeta \right) =\tilde{K}\left( \vert \theta _1 -\nu _1\vert ,\theta ',\nu '\right) . \end{aligned}$$
  • An axially symmetric kernel is called longitudinal-independent if there exists a map \(\tilde{K}: [0,\pi )^{d-2} \times [0,\pi )^{d-2}\rightarrow {\mathbb {C}}\) with

    $$\begin{aligned} K\left( \xi ,\zeta \right) =\tilde{K}\left( \theta ',\nu '\right) . \end{aligned}$$

For the next lemma and further use in the next section we introduce another simplified notation. First we define for an axially symmetric Hermitian kernel

$$\begin{aligned} c_{\ell _1}: \varLambda _{\vert \ell _1\vert } \times \varLambda _{\vert \ell _1\vert } \rightarrow {\mathbb {C}}, \end{aligned}$$
(16)

where the values of the map are as in (14). We deduce from Theorem 1 that every Hermitian, continuous and axially symmetric kernel which satisfies (7) can therefore be written as

$$\begin{aligned} K\left( \xi , \zeta \right) =\sum _{\ell _1=-\infty }^{\infty } \sum _{\ell ,\ell ' \in \varLambda _{\vert \ell _1\vert }}c_{\ell _1} \left( \ell ,\ell '\right) Y_{\ell _1,\ell }\left( \xi \right) \overline{Y_{\ell _1,\ell '}\left( \zeta \right) }. \end{aligned}$$
(17)

Additionally, we find that these spherical harmonics can be represented as

$$\begin{aligned} Y_{\ell _1,\ell }\left( \xi \right) ={\left\{ \begin{array}{ll} e^{i\ell _1\theta _1} p_{\ell _1,\ell }\left( \theta '\right) ,&{}{} \ell _1\ge 0, \\ (-1)^{\ell _1}e^{i\ell _1\theta _1}p_{-\ell _1,\ell }\left( \theta '\right) ,&{}{} \ell _1<0, \end{array}\right. } \end{aligned}$$
(18)

where \(\theta '=\left( \theta _2,\ldots , \theta _{d-1}\right) \), \(p_{\ell _1,\ell }\) is real valued, and the representation follows from (12).

We briefly state sufficient conditions of the expansion coefficients for longitudinal reversibility and independence.

Lemma 5

  1. 1.

    An axially symmetric kernel satisfying (7) is longitudinal reversible if and only if

    $$\begin{aligned} c_{\ell _1}\left( \ell ,\ell '\right) =c_{-\ell _1}\left( \ell ,\ell '\right) , \quad \forall \ell _1\in {\mathbb {Z}}, \ \ell ,\ell '\in \varLambda _{\vert \ell _1\vert }. \end{aligned}$$
  2. 2.

    An axially symmetric kernel satisfying (7) is longitudinal independent if and only if \(c_{\ell _1}\left( \ell ,\ell '\right) =0\) for all \(\ell _1\ne 0.\)

  3. 3.

    An axially symmetric kernel satisfying (7) with the above expansion is real valued if and only if

    $$\begin{aligned} c_{\ell _1}\left( \ell ,\ell '\right) =\overline{c_{-\ell _1} \left( \ell ,\ell '\right) },\quad \forall \ell _1\in {\mathbb {Z}}, \ \ell ,\ell '\in \varLambda _{\vert \ell _1\vert }. \end{aligned}$$
    (19)

Proof

1. Using the above representation of the spherical harmonics for \(\xi \) and \(\zeta \), we find

$$\begin{aligned} K\left( \xi , \zeta \right) =\sum _{\ell _1=-\infty }^{\infty } \sum _{\ell ,\ell ' \in \varLambda _{\vert \ell _1\vert }}c_{\ell _1} \left( \ell ,\ell '\right) e^{i\ell _1\theta _1-i\ell _1\nu _1} p_{\ell _1,\ell }\left( \theta '\right) p_{\ell _1,\ell '}\left( \nu '\right) . \end{aligned}$$

If we denote by \(\xi '\) the point on \({{\mathbb {S}}^{d-1}}\) with polar coordinates \((\nu _1,\theta _2,\ldots ,\theta _{d-1})\) and by \(\zeta '\) the point with polar coordinates \((\theta _1, \nu _2,\ldots ,\nu _{d-1})\) then a kernel is longitudinal reversible if \(K(\xi ,\zeta )=K(\xi ',\zeta ')\) for all \(\xi ,\zeta \in {{\mathbb {S}}^{d-1}}\). It follows as above

$$\begin{aligned} K\left( \xi ', \zeta '\right)&=\sum _{\ell _1=-\infty }^{\infty } \sum _{\ell ,\ell ' \in \varLambda _{\vert \ell _1\vert }}c_{\ell _1} \left( \ell ,\ell '\right) e^{i(-\ell _1)(\theta _1-\nu _1)} p_{\ell _1,\ell }\left( \theta '\right) p_{\ell _1,\ell '}\left( \nu '\right) \\&=\sum _{\ell _1=-\infty }^{\infty }\sum _{\ell ,\ell ' \in \varLambda _{\vert \ell _1\vert }}c_{-\ell _1} \left( \ell ,\ell '\right) e^{i\ell _1\theta _1-i\ell _1\nu _1} (-1)^{2\ell _1} p_{\ell _1,\ell }\left( \theta '\right) p_{\ell _1,\ell '} \left( \nu '\right) \\&= \sum _{\ell _1=-\infty }^{\infty }\sum _{\ell ,\ell ' \in \varLambda _{\vert \ell _1\vert }}c_{-\ell _1} \left( \ell ,\ell '\right) Y_{\ell _1,\ell }\left( \xi \right) \overline{Y_{\ell _1,\ell '}\left( \zeta \right) }. \end{aligned}$$

The result is a consequence of the uniqueness of the series representation.

2. The sufficiency of the second part follows directly from the representation. Since

$$\begin{aligned} K\left( \xi , \zeta \right)&=\sum _{\ell _1=1}^{\infty } \sum _{\ell ,\ell ' \in \varLambda _{\vert \ell _1\vert }} \left( e^{i\ell _1(\theta _1-\nu _1)}c_{\ell _1} \left( \ell ,\ell '\right) +(-1)^{2\ell _1}e^{-i\ell _1 (\theta _1-\nu _1)}c_{-\ell _1}\left( \ell ,\ell '\right) \right) \\ {}&\quad \times p_{\ell _1,\ell }\left( \theta '\right) p_{\ell _1,\ell '} \left( \nu '\right) + \sum _{\ell ,\ell ' \in \varLambda _{0}}c_0 \left( \ell ,\ell '\right) p_{0,\ell }\left( \theta '\right) p_{0,\ell '}\left( \nu '\right) \end{aligned}$$

shows that if all \(c_{\ell _1}\) are constant zero for \(\ell _1\ne 0\) the kernel is independent of the values \(\theta _1\), \(\nu _1\). For the necessity we use that

$$\begin{aligned} K\left( \xi , \zeta \right) =\sum _{\ell _1=-\infty }^{\infty } e^{i\ell _1\left( \theta _1-\varphi _1\right) } \sum _{\ell ,\ell ' \in \varLambda _{\vert \ell _1\vert }}c_{\ell _1}\left( \ell ,\ell '\right) p_{\ell _1,\ell }\left( \theta '\right) p_{\ell _1,\ell '}\left( \varphi '\right) \end{aligned}$$

is constant as a function of \(\left( \theta _1 -\varphi _1\right) \in (-2\pi ,2\pi )\) if and only if the second sum is zero for all \(\ell _1\ne 0\) and \(\theta ',\varphi '\).

3. We use the same representation above for the kernels \(K\left( \xi ,\zeta \right) \) as well as for its complex conjugate:

$$\begin{aligned} \overline{K\left( \xi , \zeta \right) }&=\sum _{\ell _1=-\infty }^{\infty } e^{-i\ell _1\left( \theta _1-\nu _1\right) } \sum _{\ell ,\ell ' \in \varLambda _{\vert \ell _1\vert }}\overline{c_{\ell _1} \left( \ell ,\ell '\right) } p_{\ell _1,\ell }\left( \theta '\right) p_{\ell _1,\ell '}\left( \nu '\right) \\&=\sum _{\ell _1=-\infty }^{\infty } e^{i(-\ell _1)\left( \theta _1-\nu _1\right) } \sum _{\ell ,\ell ' \in \varLambda _{\vert \ell _1\vert }}\overline{c_{\ell _1} \left( \ell ,\ell '\right) } (-1)^{2\ell _1}p_{-\ell _1,\ell } \left( \theta '\right) p_{-\ell _1,\ell '}\left( \nu '\right) \end{aligned}$$

and deduce that because of the uniqueness of the expansion coefficients, the two are equal if and only if \(\overline{c_{\ell _1}\left( \ell ,\ell '\right) } =c_{-\ell _1}\left( \ell ,\ell '\right) \) for all \(\ell _1\in {\mathbb {Z}}\) and \(\ell ,\ell '\in \varLambda _{\vert \ell _1\vert }\). \(\square \)

We note that in distinction to the results for invariance under parity and convolutional form the given results on axially-symmetric kernels are basis dependent. For example using a real basis of the spherical harmonics would change condition (19) to the coefficient having to be real. We finally summarise the connections between the introduced properties, which follows directly from the definitions.

Lemma 6

We assume the kernels in this proposition to be Hermitian and to satisfy (7).

  1. 1.

    In the case \(d=2\) all axially symmetric kernels have convolutional form.

  2. 2.

    An axially symmetric kernel given in the form Eq. (17) has a convolutional form if and only if

    $$\begin{aligned} c_{\ell _1}(\ell _2,\ldots ,\ell _{d-1},\ell '_2,\ldots , \ell '_{d-1})=0 \end{aligned}$$

    for all \(\ell _{d-1}\ne \ell '_{d-1}\in \mathbb {N}_0.\)

3 (Strict) Positive Definiteness of Axially Symmetric Kernels

In [6] strictly positive definite axially symmetric kernels on the 2-sphere were described and sufficient conditions for strict positive definiteness stated. We add sufficient conditions for d-dimensional spheres and prove some additional necessary conditions. In this section we focus on kernels on spheres with \(d\ge 3\), the special case of the circle is discussed in Section 5.

From now on we will use property (7) also for axially symmetric and convolutional kernels. In this case the coefficients \(a_{j,j',k,k'}\) in (7) are derived using (16) and (14) for axially symmetric kernels and (9) in the convolutional case.

Theorem 2

Let K be a Hermitian, axially symmetric kernel with property (7) given in the form (17), with \(d\ge 3\). The kernel is positive definite if and only if the mapping \(c_{\ell _1}: \varLambda _{\vert \ell _1\vert }\times \varLambda _{\vert \ell _1\vert } \rightarrow {\mathbb {C}}\) is positive definite for all \(\ell _1 \in {\mathbb {Z}}\).

Proof

For the first direction we rewrite the quadratic form

$$\begin{aligned} \sum _{\xi ,\zeta \in \varXi }\lambda _{\xi } \overline{\lambda _{\zeta }} K \left( \xi , \zeta \right) =\sum _{\ell _1=-\infty }^{\infty } \sum _{\ell ,{\ell '}\in \varLambda _{\vert \ell _1\vert }}c_{\ell _1} \left( \ell ,\ell '\right) \sum _{\xi \in \varXi }\lambda _{\xi }Y_{\ell _1,\ell } \left( \xi \right) \overline{\sum _{\zeta \in \varXi }\lambda _{\zeta } Y_{\ell _1,\ell '}\left( \zeta \right) } . \end{aligned}$$

Now we define \(\underline{\ell }=(\ell _2,\ldots ,\ell _{d-2}),\ y^{\ell _1,\underline{\ell },\ell _{d-1}}_{\varXi }:=\sum _{\xi \in \varXi }\lambda _{\xi } Y_{\ell _1,\underline{\ell },\ell _{d-1}} \left( \xi \right) \in {\mathbb {C}}\) to find

$$\begin{aligned} \sum _{\xi ,\zeta \in \varXi }\lambda _{\xi } \overline{\lambda _{\zeta }} K\left( \xi , \zeta \right)&= \sum _{\ell _1=-\infty }^{\infty } \underset{k \rightarrow \infty }{\lim } \sum _{\ell _{d-1}, {\ell '}_{d-1}=\vert \ell _1\vert }^{k} \ \sum _{\underline{\ell }\in \varLambda _{\ell _1,\ell _{d-1}}} \ \sum _{\underline{\ell '}\in \varLambda _{\ell _1,{\ell '}_{d-1}}}\\&\quad y^{\ell _1,\underline{\ell },\ell _{d-1}}_{\varXi } c_{\ell _1}\left( \left( \underline{\ell },\ell _{d-1}\right) , \left( \underline{\ell '},{\ell '}_{d-1}\right) \right) \overline{y^{\ell _1,\underline{\ell }',{\ell '}_{d-1}}_{\varXi }}, \end{aligned}$$

where

$$\begin{aligned} \varLambda _{\ell _1,\ell _{d-1}}:=\left\{ \left( \ell _2,\ldots , \ell _{d-2}\right) \in {\mathbb {N}}^{d-3}_0 \big \vert \, \vert \ell _1\vert \le \ell _2 \le \cdots \le \ell _{d-2}\le \ell _{d-1} \right\} \end{aligned}$$

and the absolute summability ensures equality after reordering. The positive definiteness of the mapping \(c_{\ell _1}\) implies that

$$\begin{aligned} \sum _{\ell _{d-1},\ell '_{d-1}=\vert \ell _1\vert }^{k} \ \sum _{\underline{\ell }\in \varLambda _{\ell _1,\ell _{d-1}}} \ \sum _{\underline{\ell '}\in \varLambda _{\ell _1,{\ell '}_{d-1}}} d_{\underline{\ell },\ell _{d-1}} c_{\ell _1} \left( \left( \underline{\ell },\ell _{d-1}\right) , \left( \underline{\ell '},{\ell '}_{d-1}\right) \right) \overline{d_{\underline{\ell '},{\ell '}_{d-1}}}\ge 0, \end{aligned}$$

for all \( \ell _1\in {\mathbb {Z}}\), \(k \ge \vert \ell _1\vert \) and arbitrary values \(d_{\underline{\ell },\ell _{d-1}}\in {\mathbb {C}}\). Thereby the limit in the penultimate equation is non-negative for all \(\ell _1\) and thereby the infinite sum is as well.

The necessity of the positive definiteness of the \(c_{\ell _1}\) is proven by contradiction. If we assume there exists an \(\ell _1 \in {\mathbb {Z}}\), a finite set \(\varLambda \subset \varLambda _{\ell _1}\) and coefficients \(d_{\ell }\in {\mathbb {C}}\) for \(\ell \in \varLambda \) satisfying

$$\begin{aligned} \sum _{\ell ,\ell '\in \varLambda }d_{\ell }c_{\ell _1}(\ell ,\ell ') \overline{d_{\ell '}}<0 \end{aligned}$$

then we can define \(f(\xi )=\sum _{\ell \in \varLambda }d_{\ell } Y_{\ell _1,\ell }(\xi )\in C({{\mathbb {S}}^{d-1}})\) and deduce

$$\begin{aligned} \int _{{{\mathbb {S}}^{d-1}}}\int _{{{\mathbb {S}}^{d-1}}}f(\xi )K(\xi ,\zeta ) \overline{f}(\zeta )\,d\xi \,d\zeta <0. \end{aligned}$$

This contradicts the positive definiteness of K since the equivalence of positive definiteness and integral positive definiteness has been established for general Riemannian manifolds in Theorem 2.1 in [9]. \(\square \)

We do not need the property of positive definiteness for arbitrary values of the index set in the proof of the result. Therefore we can loosen the assumption by introducing a matrix-like notation for the mapping. We first introduce a new index \(\alpha \) where \(\alpha \in {\mathbb {N}}\) and there is a one-to one mapping from \({\mathbb {N}}\) to the pairs

$$\begin{aligned} \left( \underline{\ell }_{\alpha },\ell _{d-1,\alpha }\right) \in \varLambda _{\vert \ell _1\vert } \end{aligned}$$

satisfying \(\ell _{d-1,\alpha '}\le \ell _{d-1,\alpha }\) if \(\alpha ' <\alpha \). For the proof, the actual order of the mapping is insignificant. We additionally define for each \(\alpha \) the eigenvalue corresponding to \(Y_{\ell _1, \underline{\ell }_{\alpha }, \ell _{d-1,\alpha }}\) by \(\lambda _{\alpha }\) which only depends on the value of \(\ell _{d-1,\alpha }\). Further we introduce the notation \(N_{\alpha }:=N_{\ell _{d-1,\alpha },d}\) for the number of spherical harmonics corresponding to the eigenvalue \(\lambda _{\alpha }\).

Then we define for each \(k\in {\mathbb {N}}\):

$$\begin{aligned} A_{\ell _1}^k=\left( c_{\ell _1}\left( \underline{\ell }_{\alpha }, \ell _{d-1,\alpha },\underline{\ell }_{\alpha '}, \ell _{d-1,\alpha '}\right) \right) _{\alpha ,\alpha '=1}^k. \end{aligned}$$
(20)

We note that in the case of \(d=3\) the matrix structure of the mappings \(c_{\ell _1}\) is evident since \(\varLambda _{\vert \ell _1\vert } =\{ j\in {\mathbb {N}}_0 \ \vert \ j\ge \vert \ell _1\vert \}\).

Proposition 1

Let K be a Hermitian, axially symmetric kernel of the form (17), \(d\ge 3\), with property (7) and \(A_{\ell _1}^k\) as above. The kernel is positive definite if and only if the matrix \(A_{\ell _1}^k\) is positive semi-definite for all \(\ell _1\in {\mathbb {Z}}\) and all \(k\in {\mathbb {N}}\).

The proof is similar to the proof of Theorem 2 and therefore omitted.

For the investigation of strict positive definiteness of axially symmetric kernels we present two approaches. For the first we continue to use the matrix representation introduced in the last proposition and for the second we will be using the mappings \(c_{\ell _1}\) and sets of indices \(\ell _1\in {\mathbb {Z}}\) which also allow to deduce strict positive definiteness.

Lemma 7

For K as in the above proposition. The kernel is strictly positive definite if and only if for each \(\ell _1\in {\mathbb {N}}\) there exist a sequence with positive entries \(d_j^{\ell _1}>0\), \(j\in {\mathbb {N}}\), such that the matrix

$$\begin{aligned} \tilde{A}_{\ell _1}^k=D_k^{1/2}A_{\ell _1}^kD_k^{1/2}, \quad \text {with } D_k=\left( \delta _{i,j}d_j^{\ell _1}\right) _{i,j=1}^k \end{aligned}$$

satisfies

$$\begin{aligned} \lambda _{\min }\left( \tilde{A}^k_{\ell _1}\right) >\varepsilon _{\ell _1}, \end{aligned}$$

where \(\varepsilon _{\ell _1}>0\) is independent of k.

Proof

From the proof of the last lemma we deduce:

$$\begin{aligned} \sum _{\xi ,\zeta \in \varXi }\lambda _{\xi } \overline{\lambda _{\zeta }} K\left( \xi , \zeta \right)&= \sum _{\ell _1=-\infty }^{\infty } \underset{k \rightarrow \infty }{\lim } \left( \mathbf {y}^{\ell _1,k}\right) ^T A^k_{\ell _1}\overline{\mathbf {y}^{\ell _1,k}}\\&\quad \text {where } \mathbf {y}^{\ell _1,k} =\left( y^{\ell _1,\underline{\ell }_{\alpha }, \ell _{d-1,\alpha }}_{\varXi }\right) _{\alpha =1}^{k}. \end{aligned}$$

Now we employ the definition of \(\tilde{A}^k_{\ell _1}\) in order to find

$$\begin{aligned} \sum _{\xi ,\zeta \in \varXi }\lambda _{\xi } \overline{\lambda _{\zeta }} K\left( \xi , \zeta \right)&=\sum _{\ell _1=-\infty }^{\infty } \underset{k \rightarrow \infty }{\lim } \left( \mathbf {y}^{\ell _1,k}\right) ^T D_k^{-1/2} \tilde{A}^k_{\ell _1}D_k^{-1/2}\overline{\mathbf {y}^{\ell _1,k}}\\ {}&\ge \sum _{\ell _1=-\infty }^{\infty }\underset{k \rightarrow \infty }{\lim }\Vert D_k^{-1/2}\mathbf {y}^{\ell _1,k}\Vert _2 ^2 \lambda _{\min } \left( \tilde{A}^k_{\ell _1}\right) \\ {}&> \sum _{\ell _1=-\infty }^{\infty }\underset{k \rightarrow \infty }{\lim } \varepsilon _{\ell _1}\left( \left\| D_k^{-1/2} \mathbf {y}^{\ell _1,k}\right\| _2\right) ^2, \end{aligned}$$

where in the second line we use that \(\tilde{A}^{k}_{\ell _1}\) is positive definite because its smallest eigenvalue is strictly positive.

Thereby

$$\begin{aligned} \sum _{\xi \in \varXi } \sum _{\zeta \in \varXi }\lambda _{\xi } \overline{\lambda _{\zeta }} K\left( \xi , \zeta \right)&> \sum _{\ell _1=-\infty }^{\infty }\varepsilon _{\ell _1} \underset{k \rightarrow \infty }{\lim } \left\| \left( D_{k}^{-1/2} \mathbf {y}^{\ell _1,k}\right) \right\| _2^2\\ {}&\ge \sum _{\ell _1=-\infty }^{\infty }\varepsilon _{\ell _1} \sum _{\alpha =1}^{\infty }\left( \left( d_{\alpha }^{\ell _1}\right) ^{-1/2} \left| y_{\varXi }^{\ell _1,\underline{\ell }_{\alpha }, \ell _{d-1,\alpha }}\right| ^2\right) , \end{aligned}$$

where the last sum can only be zero if \(y_{\varXi }^{\ell _1, \underline{\ell }_{\alpha },\ell _{d-1,\alpha }}\) is a zero sequence for all \(\ell _1\). This implies that the \(\lambda _{\xi }\) in the definition of the quadratic form need all be zero as a consequence of the linear independence of the spherical harmonics. \(\square \)

We give a simple example of kernels for which strict positive definiteness can be proven using the above result. Let the kernel take the form:

$$\begin{aligned} K(\xi ,\zeta )=\sum _{\ell _1=-\infty }^{\infty }\sum _{\ell \in \varLambda _{\vert \ell _1\vert }}c_{\ell _1}\left( \ell ,\ell \right) Y_{\ell _1,\ell }\left( \xi \right) \overline{Y_{\ell _1,\ell } \left( \zeta \right) }, \end{aligned}$$

where \(c_{\ell _1}\left( \ell ,\ell \right) >0\) and satisfies condition (7). Then \(A^k_{\ell _1}\) is a diagonal matrix with positive entries. Setting \(D_k=(\delta _{i,j}c_{\ell _1} (i,i))_{i,j=1}^{\infty }\) results in \(\tilde{A}_{\ell _1}^k=I\) and thereby the resulting kernel will be strictly positive definite. The described kernels are not isotropic but have convolutional form and therefore we will be able to give a more general condition in the next section.

Theorem 3

Let K be a Hermitian, axially symmetric kernel of the form (17) satisfying (7) for absolute summability and \(d>2\). The kernel is strictly positive definite if

$$\begin{aligned} \tilde{A}_{\ell _1}=\left( \tilde{a}^{\ell _1}_{\alpha ,\alpha '} \right) _{\alpha ,\alpha '=1}^{\infty }=\left( \sqrt{N_{\alpha }} c_{\ell _1}\left( \underline{\ell }_{\alpha },\ell _{d-1,\alpha }, \underline{\ell }_{\alpha '},\ell _{d-1,\alpha '}\right) \sqrt{N_{\alpha '}}\right) _{\alpha ,\alpha '=1}^{\infty } \end{aligned}$$

is positive definite on \(\ell ^{\infty }({\mathbb {N}})\) for all \(\ell _1\in {\mathbb {N}}\) and satisfies the uniform strict diagonal dominance property:

$$\begin{aligned} \sum _{\alpha '\ne \alpha } \vert \tilde{a}^{\ell _1}_{\alpha ,\alpha '} \vert < \sigma _{\ell _1} \vert \tilde{a}^{\ell _1}_{\alpha ,\alpha } \vert ,\quad \forall \alpha \in {\mathbb {N}}, \end{aligned}$$

where each \(0< \sigma _{\ell _1} <1\) is independent of \(\alpha \).

Proof

We denote by \(\tilde{A}_{\ell _1}=G_{\ell _1}G_{\ell _1}^*\) the infinite Cholesky decomposition matrix which exists since \(\tilde{A}_{\ell _1}\) is positive definite and bounded on \(\ell ^{\infty }({\mathbb {N}})\) as a consequence of (7).

Inserting the Cholesky-decomposition into the quadratic form

$$\begin{aligned} \sum _{\xi \in \varXi } \sum _{\zeta \in \varXi }\lambda _{\xi } \overline{\lambda _{\zeta }} K\left( \xi , \zeta \right) =\sum _{\ell _1=-\infty }^{\infty }\left\| G_{\ell _1} \left( \tilde{\mathbf {y}}^{\ell _1}\right) \right\| _2^2, \end{aligned}$$
(21)

where \(\tilde{\mathbf {y}}^{\ell _1} =\left( \frac{1}{\sqrt{N_{\alpha }}}y_{\varXi }^{\ell _1, \underline{\ell }_{\alpha },\ell _{d-1,\alpha }}\right) _{\alpha =1}^{\infty }\) is a bounded sequence as a result of the estimate of the spherical harmonics in (5). We first deduce that if there exists an element of the space of bounded sequences \(\ell ^{\infty }({\mathbb {N}})\) for which \(G_{\ell _1}^*x=0\), then \(G_{\ell _1}\left( G_{\ell _1}^*x\right) =\tilde{A}_{\ell _1}x=0\), where we can exchange the order of multiplication because of the triangular structure of \(G_{\ell _1}\) and because \(G_{\ell _1}^*x\) is elementwise finite for all bounded x. This is a result of \(\left\| G^*_{\ell _1}x \right\| ^2 =\overline{x}^T\tilde{A}_{\ell _1}x <\infty \) which follows from (7).

Thereby it is sufficient to show that there exists no eigenvector in the space of bounded sequences for which \(\tilde{A}_{\ell _1}x=0.\) We prove this by contradiction, adapting the conditions of [27] Theorem 1b.

From the positive definiteness of the sub-matrices of \(\tilde{A}^{\ell _1}\) we can deduce that \(\tilde{a}^{\ell _1}_{\alpha ,\alpha }\) is non-zero for all \(\alpha \in {\mathbb {N}}\). Thereby

$$\begin{aligned} \sum _{\alpha '=0}^{\infty } \tilde{a}^{\ell _1}_{\alpha ,\alpha '}x_{\alpha '}&= 0,\quad \forall \alpha \in {\mathbb {N}}_{0},\\ \Leftrightarrow \ \sum _{\alpha '\ne \alpha } \tilde{a}^{\ell _1}_{\alpha ,\alpha '}x_{\alpha '}&= -\tilde{a}_{\alpha ,\alpha '}x_{\alpha }, \quad \forall \alpha \in {\mathbb {N}}_{0}. \end{aligned}$$

Since x has \(\ell ^{\infty }\)-norm one, there exists a value \(\alpha \in {\mathbb {N}}_{0}\), for which \(\vert x_\alpha \vert >\sigma _{\ell _1}\) and

$$\begin{aligned} \sum _{\alpha '\ne \alpha } \vert \tilde{a}^{\ell _1}_{\alpha ,\alpha '}\vert \ge \sum _{\alpha '\ne \alpha } \vert \tilde{a}^{\ell _1}_{\alpha ,\alpha '} x_{\alpha '} \vert \ge \vert \tilde{a}_{\alpha ,\alpha }\vert \sigma _{\ell _1}. \end{aligned}$$

This contradicts the assumption and the only possible choice for which (21) is equal to zero is \(\tilde{\mathbf{y }}^{\ell _1}=0\) for all \(\ell _1\in {\mathbb {Z}}\). From the definition of this sequence we know that this is only possible if \(\lambda _{\xi }=0\) for all \(\xi \in \varXi \) as in the proof of the last lemma. \(\square \)

Examples for kernels for which this result is applicable are all kernels where a certain ordering of the coefficients leads to \(A_{\ell _1}\) having the form of a general band matrix.

We wish to find more general sufficient conditions that are easy to evaluate. They will be stronger than simply demanding that \(c_{\ell _1}\) be strictly positive definite, which would only allow a finite subset of the indices \(\varLambda _{\ell _1}\) to be used in a quadratic form.

Therefore we define the set \({\mathcal {F}}\) as the set of all indices \(\ell _1 \in {\mathbb {Z}}\) for which

$$\begin{aligned} \text {there exists}\ j \in \varLambda _{\vert \ell _1\vert }, d_j\ne 0 \quad \Rightarrow \quad \sum _{j,j' \in \varLambda _{\vert \ell _1\vert }}d_jc_{\ell _1} \left( j,j'\right) \overline{d_{j'}} \end{aligned}$$

does not converge to zero.

Lemma 8

A Hermitian axially symmetric kernel K of the form (17) , satisfying (7), with positive definite maps \(c_{\ell _1}:\varLambda _{\vert \ell _1\vert } \times \varLambda _{\vert \ell _1\vert }\rightarrow {\mathbb {C}}\) is strictly positive definite if

$$\begin{aligned} \sum _{\xi \in \varXi }\lambda _{\xi } Y_{\ell _1,\ell } \left( \xi \right) =0\ \forall \ell _1\in {\mathcal {F}}, \ \ell \in \varLambda _{\vert \ell _1\vert } \quad \Rightarrow \quad \lambda _{\xi }=0\ \forall \xi \in \varXi . \end{aligned}$$

Proof

We prove the result by contradiction and assume that K is positive definite but not strictly positive definite. If K is not strictly positive definite there exists a nonempty set of distinct points \(\varXi \) and coefficients \(\lambda _{\xi }\) not all zero with

$$\begin{aligned} \sum _{\xi ,\zeta \in \varXi }\lambda _{\xi } \overline{\lambda _{\zeta }} K\left( \xi , \zeta \right) =0. \end{aligned}$$

This is equivalent according to the computation in the proof of Theorem 2 to

$$\begin{aligned} \sum _{\ell _1=-\infty }^{\infty }\sum _{\ell ,\ell ' \in \varLambda _{\vert \ell _1\vert }}c_{\ell _1} \left( \ell ,\ell '\right) Y_{\ell _1,\ell ,\varXi } \overline{Y_{\ell _1,\ell ',\varXi }}=0, \end{aligned}$$
(22)

where \(Y_{\ell _1,\ell ,\varXi }=\sum _{\xi \in \varXi } \lambda _{\xi } Y_{\ell _1,\ell }\left( \xi \right) \).

Since we know that all the sums are non-negative because the maps \(c_{\ell _1}\) are all positive definite, the overall sum can only be zero if all summands are. For the indices \(\ell _1\in {\mathcal {F}}\) this implies \(Y_{\ell _1,\ell ,\varXi }=0\), because the convergence of the sum is ensured by property (7). But according to the condition of the lemma this implies \(\lambda _\xi =0\) for all \(\xi \in \varXi \), in contradiction to the assumption of the proof. \(\square \)

With the goal of understanding which sets \({\mathcal {F}}\) lead to strictly positive definite kernels we first state a simple sufficient condition. The following proposition is an immediate consequence of the last lemma together with the linear independence of the spherical harmonics.

Proposition 2

A kernel as in in the last lemma is strictly positive definite if \({\mathcal {F}}={\mathbb {Z}}\).

For the special case \(d=3\), this result was proven in [6] as Theorem 2, where the assumptions on the maps can be simplified because of the specific structure of \({\mathbb {S}}^2\). As a next step we study necessary conditions:

Lemma 9

Let K be a Hermitian, axially symmetric kernel of the form (17), satisfying (7). For the kernel to be strictly positive definite it is necessary that the mapping

$$\begin{aligned} c_{\ell _1}: \varLambda _{\vert \ell _1\vert } \times \varLambda _{\vert \ell _1 \vert } \rightarrow {\mathbb {C}}\end{aligned}$$

is not identically zero for infinitely many \(\ell _1 \in {\mathbb {Z}}\).

Proof

The result is proven by contradiction. Assume there are only finitely many values of \(\ell _1\) for which \(c_{\ell _1}\) is not the zero mapping. Let us denote the set of these \(\ell _1\) by \({\mathcal {J}}\). Then we can construct a set of points \(\varXi \) and coefficients \(\lambda _{\xi }\in {\mathbb {R}}\) such that

$$\begin{aligned} \sum _{\xi \in \varXi }\lambda _{\xi }Y_{\ell _1,\ell } \left( \xi \right) =0,\quad \forall \ell _1\in {\mathcal {J}}, \ \ell \in \varLambda _{\vert \ell _1\vert }. \end{aligned}$$
(23)

These are infinitely many conditions since \(\varLambda _{\vert \ell _1\vert }\) includes infinitely many elements but if we choose all points in \(\varXi \) of the form \(\left( \theta _{1,k}, \theta _{2}, \ldots ,\theta _{d-1}\right) \) for \(k=0,\ldots ,\vert {\mathcal {J}}\vert \), the above equations can be transformed into

$$\begin{aligned} \frac{1}{\sqrt{2\pi }}\prod _{j=2}^{d-1}{}_{j} \tilde{P}_{\ell _j}^{\ell _{j-1}}\left( \theta _{j}\right) \sum _{k=0}^{\vert {\mathcal {J}}\vert }\lambda _{k} e^{i\ell _1\theta _{1,k}} =0,\quad \forall \ell _1\in {\mathcal {J}}, \ \ell \in \varLambda _{\vert \ell _1\vert }, \end{aligned}$$
(24)

which is satisfied if

$$\begin{aligned} \sum _{k=0}^{\vert {\mathcal {J}}\vert }\lambda _{k} e^{i\ell _1\theta _{1,k}} =0,\quad \forall \ell _1\in {\mathcal {J}}. \end{aligned}$$
(25)

These are only \(\vert {\mathcal {J}}\vert \) conditions which can be satisfied for our choice of \(\theta _{1,k}\) at \(\vert {\mathcal {J}} \vert +1\) distinct points. \(\square \)

The next result is especially interesting because for spherical radial basis functions we were used to be able to leave out finitely many indices of similar sets without loosing strict positive definiteness. This is not the case for axially symmetric kernels, where \(\ell _1=0\) is of special importance.

Lemma 10

Let K be a Hermitian, axially symmetric kernel of the form (17) satisfying (7). For the kernel to be strictly positive definite, it is necessary that the mapping

$$\begin{aligned} c_{0}: \varLambda _{\vert \ell _1 \vert } \times \varLambda _{\vert \ell _1\vert } \rightarrow {\mathbb {C}}\end{aligned}$$

is not identically zero.

Proof

We show that for any point on the surface of the sphere \(\xi \) which has polar coordinates \(\left( \theta _1,0,\theta _3,\ldots , \theta _{d-1}\right) \), the spherical basis functions \(Y_{\ell _1,\ell }\left( \xi \right) \) only take non-zero values if \(\ell _1=0\). Inserting the point into the definition of the spherical harmonics from (12)

$$\begin{aligned} Y_{\ell _1,\ell }\left( \xi \right)&=\frac{1}{\sqrt{2\pi }} e^{i\ell _1\theta _1} \prod _{j=2}^{d-1}{}_{j} \tilde{P}_{\ell _j}^{\ell _{j-1}}\left( \theta _{j}\right) \\ {}&=\frac{1}{\sqrt{2\pi }}e^{i\ell _1\theta _1} \left( \frac{2\ell _2+1}{2}\frac{\left( \ell _2+\ell _1\right) !}{\left( \ell _2-\ell _1\right) !}\right) ^{1/2} P_{\ell _2}^{-\ell _1}\left( 1\right) \prod _{j=3}^{d-1}{}_{j} \tilde{P}_{\ell _j}^{\ell _{j-1}}\left( \theta _{j}\right) . \end{aligned}$$

Since the last part of the product is finite, the value is non-zero only if \(P_{\ell _2}^{-\ell _1}\left( 1\right) \) is non-zero and this is only the case for \(\ell _1=0\). \(\square \)

Two ways to construct axially symmetric kernels on the two-sphere for which the described conditions can be applied are described in [2] where the definition is via sequences and [11] via integration of isotopic kernels over arches.

4 (Strict) Positive Definiteness of Convolutional Kernels or Kernels with Eigenvalue Block Structure

We now assume that K is continuous Hermitian and has the form

$$\begin{aligned} K\left( \xi , \zeta \right) =\sum _{j=0}^{\infty }\sum _{k=1}^{N_{j,d}} \sum _{k'=1}^{N_{j,d}} d_j\left( k,k'\right) Y_j^k\left( \xi \right) \overline{Y_{j}^{k'}\left( \zeta \right) },\qquad \forall \xi ,\zeta \in {{\mathbb {S}}^{d-1}}, \end{aligned}$$
(26)

for a fixed set of orthogonal basis functions, where \(Y_{j}^{k}\) is an eigenfunction corresponding to the eigenvalue \(\lambda _j\) of Sect. 1.1. We note that under this assumption there exists another orthonormal basis of the eigenfunctions corresponding to each eigenvalue such that, if we express K with respect to this sequence (which is also the Hilbert-Schmidt basis of the kernel), it takes the form (10). Further we emphasize that in this section we use continuity as a criterion instead of property (7), because it is the weaker assumption on the kernel.

These kernels were studied under the name convolutional kernels for example in [21, 22]. The fixed basis case on the 2-sphere was also studied in [7] and we now generalise the results.

It is proven in [10] that a continuous and Hermitian kernel K of the form (10) is positive definite if and only if the \(d_{j,k}\) are real and non-negative for all \(j \in {\mathbb {Z}}_+\), \(k\in \left\{ 1,\ldots ,N_{j,d}\right\} \), (their Theorem 2.1).

We can now transfer the result known for radial kernels which was first proven in [8], to this kernel class. Since positive definiteness is independent of the choice of the expansion basis, the proofs will be carried out using kernels of the form (10) and we later briefly note the implications for fixed basis.

Lemma 11

If a continuous and Hermitian kernel K of the form (10) is strictly positive definite, all \(d_{j,k}\) are non-negative and the sequences \(\left( d_{j,k} \right) _{k=1}^{N_{j,d}}\) are not identically zero for infinitely many even and infinitely many odd values of \(j\in {\mathbb {Z}}_+\).

Proof

We assume K is continuous, Hermitian and strictly positive definite. Since strict positive definiteness implies positive definiteness, the \(d_{j,k}\) are all non-negative according to [10, Theorem 2]. The rest of the proof is divided into four cases. Let us denote the set of indices for which \(\left( d_{j,k}\right) _{k=1}^{N_{j,d}}\) are not all zero with

$$\begin{aligned} {\mathcal {J}}:=\left\{ j \in {\mathbb {Z}}_+ \big \vert \left( d_{j,1}, \ldots ,d_{j,N_{j,d}}\right) ^T \ne \mathbf {0}^{N_{j,d}}\right\} . \end{aligned}$$

For all four cases we assume that K is strictly positive definite of the form (10) and prove that assuming either

  1. 1.

    \({\mathcal {J}} \subset 2{\mathbb {N}}\) or

  2. 2.

    \({\mathcal {J}}\subset 2{\mathbb {N}}+1\) or

  3. 3.

    \(1\le \vert {\mathcal {J}}\cap 2{\mathbb {N}}\vert < \infty \) or

  4. 4.

    \(1\le \vert {\mathcal {J}}\cap \left( 2{\mathbb {N}}+1\right) \vert < \infty \),

leads to a contradiction.

  1. 1.

    Let us now assume \(\left( d_{j,k}\right) _{k=1}^{N_{j,d}}\) are not all zero only for even values of j. We can represent the quadratic form as

    $$\begin{aligned} \sum _{\xi ,\zeta \in \varXi }\lambda _{\xi } \overline{\lambda _{\zeta }} K \left( \xi , \zeta \right)&=\sum _{j=0}^{\infty }\sum _{k=1}^{N_{j,d}} d_{j,k}\sum _{\xi \in \varXi }\lambda _{\xi } \tilde{Y}_j^k \left( \xi \right) \overline{\sum _{\zeta \in \varXi }\lambda _{\zeta } \tilde{Y}_{j}^{k}\left( \zeta \right) } \\&=\sum _{j=0}^{\infty }\sum _{k=1}^{N_{j,d}} y^{j,k}_{\varXi } d_{j,k} \overline{y^{j,k}_{\varXi }},\quad \text { with } y^{j,k}_{\varXi } =\sum _{\xi \in \varXi }\lambda _{\xi } \tilde{Y}_j^k\left( \xi \right) \in {\mathbb {C}}. \end{aligned}$$

    Choosing a non-empty set of data sites \(\varXi '\) which satisfies

    $$\begin{aligned} \xi \in \varXi '\ \Rightarrow \ -\xi \in \varXi ', \end{aligned}$$

    where \(-\xi \) is the antipodal point of \(\xi \), and setting \(\lambda _{\xi }=-\lambda _{-\xi }\), we find that \(y_{\varXi '}^{j,k}=0\) for all even j since \(\tilde{Y}_j^k\left( -\xi \right) =\left( -1\right) ^j\tilde{Y}_j^k\left( \xi \right) \) follows from the \(\tilde{Y}_j^k\) all being homogeneous polynomials of order j. This implies \(\sum _{\xi ,\zeta \in \varXi '}\lambda _{\xi } \overline{\lambda _{\zeta }} K\left( \xi , \zeta \right) =0\) and therefore K would not be strictly positive definite.

  2. 2.

    The same argument applies when we assume \(\left( d_{j,k}\right) _{k=1}^{N_{j,d}}\) are not all zero only for odd values of j. The same set \(\varXi '\) can be chosen but now \(\lambda _{\xi }=\lambda _{-\xi }\) yields the contradiction to the assumption of strict positive definiteness.

  3. 3.

    Now we assume that \(\left( d_{j,k}\right) _{k=1}^{N_{j,d}}\) are not all zero for any number of odd values of j and only finitely many even values of j. Set \(\hat{\jmath }\) to the maximal even index for which \(\left( d_{j,k} \right) _{k=1}^{N_{j,d}}\) is not zero. We aim to construct a set \(\varXi \) with elements only in the lower hemisphere of \({{\mathbb {S}}^{d-1}}\) and \(\lambda \in {\mathbb {C}}^{\vert \varXi \vert }\ne 0\), s.t. \(\sum _{\xi \in \varXi }\lambda _{\xi } \tilde{Y}_j^k\left( \xi \right) =0\) for all even \(j \le \hat{\jmath }\). These are

    $$\begin{aligned} M:=\sum _{m=1}^{\hat{\jmath }/2}N_{2m,d}<\infty \end{aligned}$$

    linear equations. We can therefore choose any set of distinct points in the lower hemisphere with more than M elements to find a non-trivial solution. Defining the set \(\varXi '=\varXi \cup \left( -\varXi \right) \) and setting \(\lambda _{\xi }=\lambda _{-\xi }\) shows that

    $$\begin{aligned} \sum _{\xi ,\zeta \in \varXi '}\lambda _{\xi } \overline{\lambda _{\zeta }} K\left( \xi , \zeta \right) =0 \end{aligned}$$

    for a non-trivial vector \(\lambda \) and therefore K is not strictly positive definite.

  4. 4.

    The same arguments can be used to show that \(\left( d_{j,k}\right) _{k=1}^{N_{j,d}}\) needs to be non-zero for infinitely many odd values of j.

\(\square \)

For the fixed basis form this implies that for a strictly positive definite kernel of the form (26) it is necessary that the matrices \(D_j:=\left( d_j\left( k,k'\right) \right) _{k,k' =1}^{N_{j,d}}\) are all positive definite and infinitely many with odd j and infinitely many with even j are not the zero matrix.

With the aim of finding sufficient conditions that are easy to evaluate we define the set \({\mathcal {F}}\) as the set of all indices \(j \in {\mathbb {Z}}_+\) for which \(D_j\) is a positive definite matrix, or if the appropriate basis is used, all \(d_{j,k}>0\) for all \(k=1,\ldots ,N_{j,d}\).

Lemma 12

A continuous and Hermitian kernel K of the form (26) which is positive definite is strictly positive definite if

$$\begin{aligned} \sum _{\xi \in \varXi }\lambda _{\xi } Y_j^k\left( \xi \right) =0, \ \forall \ k=1, \ldots ,N_{j,d},\ j\in {\mathcal {F}} \quad \Rightarrow \quad \lambda _{\xi }=0, \forall \xi \in \varXi . \end{aligned}$$

The proof follows in the same line of argument as Lemma 8.

With Lemma 12, we have shown that strict positive definiteness of these kernels can be proven using the same results which were used for zonal kernels by Chen et al. in [8] and the alternative proof for more general manifolds stated by Barbosa and Menegatto in [4]. The next lemma and Theorem 4, generalise the results to the convolutional case and complex kernels.

Lemma 13

For a given set \({\mathcal {F}}\) the following two properties are equivalent:

  1. 1.

    \(\sum _{\xi \in \varXi }\lambda _{\xi } Y_j^k\left( \xi \right) =0,\ \forall \ k=1, \ldots ,N_{j,d},\ j\in {\mathcal {F}}\) implies \(\lambda _{\xi }=0,\ \forall \xi \in \varXi .\)

  2. 2.

    \(\sum _{\xi \in \varXi }\lambda _{\xi } P^d_{j}\left( \xi ^T\zeta \right) =0,\ \forall j \in {\mathcal {F}},\, \forall \zeta \in {{\mathbb {S}}^{d-1}}\) implies \(\lambda _{\xi }=0,\ \forall \xi \in \varXi \).

Here \(P_j^{d}\) are the Legendre polynomials of degree j in d dimensions.

Proof

Using the addition formula of the spherical harmonics we find

$$\begin{aligned} \frac{N_{j,d}}{\sigma _{d-1}}P_{j}^d\left( \xi ^T\zeta \right) =\sum _{k=1}^{N_{j,d}} Y_{j}^k\left( \xi \right) \overline{Y_{j}^k \left( \zeta \right) .} \end{aligned}$$

Therefore

$$\begin{aligned} \frac{N_{j,d}}{\sigma _{d-1}} \sum _{\xi \in \varXi } \lambda _{\xi } P_{j}^d\left( \xi ^T\zeta \right) =\sum _{k=1}^{N_{j,d}} \left( \sum _{\xi \in \varXi } \lambda _{\xi }Y_{j}^k\left( \xi \right) \right) \overline{Y_{j}^k\left( \zeta \right) }, \qquad \forall \zeta \in {{\mathbb {S}}^{d-1}}. \end{aligned}$$

Since the spherical harmonics are linearly independent, the last expression is zero if and only if

$$\begin{aligned} \sum _{\xi \in \varXi }\lambda _{\xi }Y_{j}^k\left( \xi \right) =0 \end{aligned}$$

for all \(k=1,\ldots ,N_{j,d}\), and all \(j\in {\mathcal {F}}\). \(\square \)

Theorem 4

Let K be a continuous positive definite kernel of the form (26), \(d>2\), and \({\mathcal {F}}\) the corresponding index set for which the \(D_j\) are positive definite matrices. Then it is sufficient for K to be strictly positive definite that \({\mathcal {F}}\) includes infinitely many even and infinitely many odd values of \(j\in {\mathbb {Z}}_+\).

Proof

Combining Lemmas 12 and 13 we know that for K to be strictly positive definite it is sufficient to prove that for arbitrary sets of distinct data sites \(\varXi \subset {{\mathbb {S}}^{d-1}}\) the functions \(P_j^d\left( \xi ^T\zeta \right) \) satisfy Lemma 13 (2). Therefore we show that for any such set \(\varXi \)

$$\begin{aligned} \sum _{\xi \in \varXi }\lambda _{\xi }P_j^d\left( \xi ^T\zeta \right) =0, \qquad \forall j \in {\mathcal {F}},\ \zeta \in {{\mathbb {S}}^{d-1}}, \end{aligned}$$

implies \(\lambda _{\xi }=0\) for all \(\xi \in \varXi \). We do this by choosing for each \(\xi \in \varXi \) a corresponding \(\zeta =\zeta _{\xi }\in {{\mathbb {S}}^{d-1}}\) and show that for this choice \(\lambda _{\xi }=0\) if the above holds. Assume we have fixed \(\xi \in \varXi \), we need to distinguish two cases.

Case 1 \( \xi ^T\zeta \ne -1\) for all \(\zeta \in \varXi .\)

In this case we choose \(\zeta _{\xi }=\xi \) and the system above takes the form

$$\begin{aligned} \lambda _{\xi } P_j^d\left( 1\right) +\sum _{\zeta \in \varXi \setminus \{ \xi \}}\lambda _{\zeta }P_j^d\left( \zeta ^T\xi \right) =0, \qquad \forall j \in {\mathcal {F}}. \end{aligned}$$

Note that \(\xi ^T\zeta \in \left( -1,1\right) \). Since there are infinitely many indices in \({\mathcal {F}}\) we can choose a sequence from \({\mathcal {F}}\) with \(j_n\in {\mathcal {F}}\) and \(\underset{n \rightarrow \infty }{\lim } j_n=\infty \). Introducing the limit in the above equation and using the asymptotic estimate for Legendre polynomials in [3] (2.117) we get

$$\begin{aligned} \lambda _{\xi } + \underset{n \rightarrow \infty }{\lim } \frac{\sum _{\zeta \in \varXi \setminus \{ \xi \}}\lambda _{\zeta } P_j^d\left( \xi ^T\zeta \right) }{P_j^{d}\left( 1\right) }=0. \end{aligned}$$

This implies \(\lambda _{\xi }=0\) because the sum is finite and the latter part converges to zero.

Case 2 There is one \(\zeta \in \varXi \) with \(\xi ^T \zeta =-1\).

Then \(\zeta \) is the antipodal point of \(\xi \) and therefore in Cartesian coordinates \(\xi =-\zeta \). Then the above equation becomes

$$\begin{aligned} \lambda _{\xi } P_j^d\left( 1\right) +\left( -1\right) ^j \lambda _{-\xi } P_j^d\left( 1\right) +\sum _{\zeta \in \varXi \setminus \{ \xi , -\xi \}}\lambda _{\zeta }P_j^d \left( \zeta ^T\xi \right) =0, \qquad \forall j \in {\mathcal {F}}. \end{aligned}$$

The third part vanishes if we introduce the limit as in the previous case but for odd and even series of \(j_n\) separately. The remainder yields for even j and odd j, respectively,

$$\begin{aligned} \lambda _{\xi }+\lambda _{-\xi }=0= \lambda _{\xi }-\lambda _{-\xi } \end{aligned}$$

which implies \(\lambda _{\xi }=0=\lambda _{-\xi }\). \(\square \)

5 The Special Case of Kernels on the Circle

The case \(d=2\) is of special interest for us since the results of Theorem 4 do not apply to this case and since according to Lemma 6 (1) all axially symmetric kernels on the circle have convolutional form. For \(d=2\) we note that a basis of the spherical harmonics is

$$\begin{aligned} Y_{j}\left( \xi \right) =\frac{1}{\sqrt{2\pi }}e^{ij\theta },\quad j\in {\mathbb {Z}}, \end{aligned}$$

where \(\theta \) is associated with the point on the circle \(\xi =\left( \cos \left( \theta \right) ,\sin \left( \theta \right) \right) ^T \in {\mathbb {S}}^1\). The Legendre polynomials take the form of Chebyshev polynomials and the addition formula reads

$$\begin{aligned} Y_j(\xi )\overline{Y}_j(\zeta )+ Y_{-j}(\xi )\overline{Y}_{-j}(\zeta ) =\frac{2}{\pi }\cos \left( j (\theta -\nu )\right) , \end{aligned}$$

where \(\theta \) is corresponding to \(\xi \) as above and \(\zeta =\left( \cos \left( \nu \right) ,\sin \left( \nu \right) \right) ^T\). Using the circular harmonics every continuous kernel on \({\mathbb {S}}^1\times {\mathbb {S}}^1\) has an expansion of the form

$$\begin{aligned} K(\xi ,\zeta )=\sum _{j,j'\in {\mathbb {Z}}}a_{j,j'}e^{ij\theta }e^{-ij'\nu }. \end{aligned}$$

For a Hermitian convolutional kernel only the coefficients corresponding to the same order of trigonometric polynomial are going to be non-zero. This yields

$$\begin{aligned} K(\xi ,\zeta )&=\sum _{j\in {\mathbb {Z}}_+}\left( d_{j,j}e^{ij(\theta -\nu )} +d_{-j,-j}e^{-ij(\theta -\nu )}+d_{-j,j}e^{-ij(\theta +\nu )} +\overline{d_{-j,j}}e^{ij(\theta +\nu )}\right) \\&\quad +d_{0,0}, \end{aligned}$$

where we can define the matrices of the last section as \( D_j=\begin{pmatrix} d_{j,j} &{}{} d_{j,-j} \\ &{} \\ \overline{d_{j,-j}} &{}{}d_{-j,-j} \end{pmatrix}\).

Theorem 5

Let K be a continuous positive definite kernel of the form (26), \(d=2\), and \({\mathcal {F}}\) be the corresponding index set for which the \(D_j\) are positive definite matrices. Then it is sufficient for K to be strictly positive definite that the set \(\{n\in {\mathbb {Z}}\mid |n|\in {\mathcal {F}}\}\) intersects every full arithmetic progression in \({\mathbb {Z}}_+\).

Proof

For this special case Lemma 13 (2) reads

$$\begin{aligned} \sum _{\xi \in \varXi }\lambda _{\xi }\cos \left( j\arccos (\xi ^T\zeta )\right) =0, \quad \forall j \in {\mathcal {F}},\ \forall \xi \in {\mathbb {S}}^1, \end{aligned}$$

implies \(\lambda _{\xi }=0\) for all \(\xi \in \varXi \). This is the case if all functions of the form \(\sum _{j\in {\mathcal {F}}}a_j\cos \left( j\arccos (\xi ^T\zeta )\right) \) with \(a_j>0\) are positive definite and thereby we can deduce our result directly from Corollary 2.3 and Theorem 2.15 of [20]. \(\square \)

Theorem 6

Let K be a continuous positive definite kernel of the form (26), \(d=2\) and \({\mathcal {J}}\) the corresponding index set for which the \(D_j\) are non-zero. Then it is necessary for K to be strictly positive definite that \({\mathcal {J}}\) intersects every full arithmetic progression in \({\mathbb {Z}}_+\).

Proof

The proof again uses contradiction and the construction of sets which show that Lemma 13 (2) does not hold if \({\mathcal {J}}\) does not intersect every full arithmetic progression. As in the previous cases this construction is the same which was used in the proof for isotropic kernels and we therefore refer to the proof of [4] Theorem 5.2, where details of the construction are given. \(\square \)

We briefly mention implications for kernel is axially symmetric kernels on \({\mathbb {S}}^1\). According to Theorem 1, a kernel is axially symmetric if

$$\begin{aligned} a_{j,j'}=\delta _{j,j'}c_j. \end{aligned}$$

Thereby a Hermitian axially symmetric kernel takes the form

$$\begin{aligned} K(\xi ,\zeta )=\sum _{j\in {\mathbb {Z}}_+}\left( c_{j}e^{ij(\theta -\nu )} +c_{-j}e^{-ij(\theta -\nu )}\right) +c_{0}. \end{aligned}$$

Then Theorems 5 and 6 directly apply to the axially symmetric kernels, where \({\mathcal {F}}\) is the set of \(j\in {\mathbb {Z}}\) for which both \(c_j,c_{-j}>0\) and \({\mathcal {J}}\) is the set of indices j for which \(c_j\) or \(c_{-j}\) are non-zero.

6 Discussion

The discussion in Section 2 allows to construct kernels with different geometric properties. Even though these kernels could only be constructed approximately as truncated series we believe that they can nevertheless be used in applications as discussed in [2] for axially symmetric kernels. We also believe that taking into account a wider range of geometric properties of the kernel will allow the improvement of approximation results for many areas of application. Unfortunately any truncation would remove strict positive definiteness. This problem can be addressed in applications for example by choosing the truncation index high enough so that the remaining error is close to machine precision.

The more specific results in Section 3 show that stating necessary and sufficient conditions for axially symmetric strictly positive definite kernels is possible in a similar way to the results known for radial kernels and that in fact one can utilize the results known for isotropic kernels to prove the more general case, as done in Section 4.