Abstract
We present some new results concerning Lebesgue-type inequalities for the Weak Chebyshev Greedy Algorithm (WCGA) in uniformly smooth Banach spaces \({{\mathbb {X}}}\). First, we generalize Temlyakov’s theorem (Temlyakov in Forum Math Sigma 2(12):26, 2014) to cover situations in which the modulus of smoothness and the \({\texttt {A3}}\) parameter are not necessarily power functions. Secondly, we apply this new theorem to the Zygmund spaces \({{\mathbb {X}}}=L^p(\log L)^{\alpha }\), with \(1<p<\infty \) and \({\alpha }\in {{\mathbb {R}}}\), and show that, when the Haar system is used, then exact recovery of N-sparse signals occurs when the number of iterations is \(\phi (N)=O(N^{\max \{1,2/p'\}} \,(\log N)^{|{\alpha }| p'})\). Moreover, this quantity is sharp when \(p\le 2\). Finally, an expression for \(\phi (N)\) in the case of the trigonometric system is also given.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
In this paper we consider several theoretical aspects regarding N-term approximation in a Banach space \(({{\mathbb {X}}},\Vert \cdot \Vert )\), over a field \({{\mathbb {K}}}={{\mathbb {R}}}\) or \({{\mathbb {C}}}\).
A fundamental question in this topic is, given a dictionary \({{\mathcal {D}}}=\{{\varphi }_i\}_{i\in {{\mathcal {I}}}}\) in \({{\mathbb {X}}}\), and the corresponding set of N-sparse vectors
then find constructive procedures (algorithms) \({\mathscr {A}}_N:{{\mathbb {X}}}\rightarrow \Sigma _N\), where for all \(f\in {{\mathbb {X}}}\) the quantity \(\Vert f-{\mathscr {A}}_N(f)\Vert \) is as close as possible to the best error of N-term approximation, defined by
Once an algorithm \({\mathscr {A}}_N\) is fixed, one can quantify the above statement by considering the associated Lebesgue-type inequality, which amounts to find the smallest value of \(\phi (N)\) so that
with C a fixed universal constant (if it exists). Observe, in particular, that (1.1) guarantees exact recovery of all N-sparse signals after \(\phi (N)\) iterations, that is
Ideally, one would like to find algorithms \({\mathscr {A}}_N\) so that (1.1) holds with \(\phi (N)=N\) (and \(C=1\)). But this is hardly possible in many situations (a notable exception being when \({{\mathcal {D}}}\) is an orthonormal basis in a Hilbert space). For instance, in the classical case when \({{\mathcal {D}}}\) is the trigonometric system in \(L^p({{\mathbb {T}}})\), \(p\not =2\), it is still a relevant open question to find one such (constructive) algorithm.
In this paper we shall be interested in the Weak Chebyshev Greedy Algorithm (WCGA), which was introduced by Temlyakov in [12] as a generalization to Banach spaces of the celebrated Orthogonal Matching Pursuit (OMP) from Hilbert spaces. We refer to [13, 15, 16], and references therein, for background on this topic.
Lebesgue-type inequalities for the WCGA were proved in [7, 14]; see also [16, Chapter 8] for a historical overview. One the features of WCGA is that it has good approximation properties for the trigonometric system in \(L^p\). Indeed, it was shown in [14, (4.3)] that, if \(p>2\), then Lebesgue inequalities hold with only \(\phi (N)=O(N\log N)\) iterations. This seems to be the best known result with a constructive algorithm in that setting. Likewise, for the univariate Haar system in \(L^p\), if \(1<p\le 2\), then it suffices with \(\phi (N)=O(N)\) iterations; see [14, (4.7)].
The above results are special cases of a deep theorem proved by Temlyakov in [14, Theorem 2.8], which we describe in detail below. In that theorem, the number of iterations \(\phi (N)\) is estimated in terms of some intrinsic properties of the pair \(({{\mathbb {X}}},{{\mathcal {D}}})\), namely, the power type of the modulus of smoothness of \(({{\mathbb {X}}},\Vert \cdot \Vert )\), and the power function associated with the so-called property \({\texttt {A3}}\) of \({{\mathcal {D}}}\); see (1.16) below.
Our main result in this paper, Theorem 1.12, will be a generalization of Temlyakov’s theorem, which allows to cover situations in which the modulus of smoothness and the \({\texttt {A3}}\) parameters are not necessarily power functions. This is actually needed in some special cases, such as when \({{\mathbb {X}}}=L^p(\log L)^{\alpha }\), for which additional log factors appear naturally. Our next results, Theorems 1.17 and 5.20, will be applications of Theorem 1.12 to this setting, for two special dictionaries, the Haar and the trigonometric system.
We next give a more detailed description of these results.
1.1 Statements of Results
We assume that \(({{\mathbb {X}}},\Vert \cdot \Vert )\) is a uniformly smooth Banach space, meaning that its modulus of smoothness
satisfies \(\rho _{{\mathbb {X}}}(t)=o(t)\) as \(t\rightarrow 0\). Given \(f\in {{\mathbb {X}}}\) with \(f\not =0\), we let \(F_f\in {{\mathbb {X}}}^*\) be the associated norming functional, that is, the (unique) element in \({{\mathbb {X}}}^*\) such that
Uniqueness follows from the smoothness of the norm \(\Vert \cdot \Vert \).
We say that \({{\mathcal {D}}}=\{{\varphi }_i\}_{i\in {{\mathcal {I}}}}\) is a dictionary in \({{\mathbb {X}}}\), if it consists of non-null vectors whose closed linear span is \({{\mathbb {X}}}\), that is
We do not assume the dictionary elements to be normalized, although as a consequence of later properties \({{\mathcal {D}}}\) will be semi-normalized, that is
for some constants \(\mathfrak {c_1}\ge \mathfrak {c_0}>0\); see Sect. 2.1 below.
Definition 1.2
(Weak Chebyshev Greedy Algorithm (WCGA)) Given a fixed \(\tau \in (0,1]\), a \(\tau \)-WCGA associated with \(({{\mathbb {X}}},\Vert \cdot \Vert ,{{\mathcal {D}}})\) is any collection of mappings
with the following properties:
Given \(f\in {{\mathbb {X}}}\setminus \{0\}\), we let \(f_0:=f\) and define inductively vectors \({\varphi }_{i_1},\ldots , {\varphi }_{i_n}\) in \({{\mathcal {D}}}\) and \(f_1,\ldots , f_n\in {{\mathbb {X}}}\) by the following procedure: at step \(n+1\) we pick any \({\varphi _{i_{n+1}}}\in {{\mathcal {D}}}\) such that
and let \({\mathscr {G}}_{n+1}(f)\) be any element in \([{\varphi }_{i_1},\ldots ,{\varphi }_{i_{n+1}}]\) such that
Then we set \(f_{n+1}=f-{\mathscr {G}}_{n+1}(f)\), and iterate the process (indefinitely, or until the remainder \(f_{n+1}=0\)).
If at some stage we have \(f_n=0\), then we just let \({\mathscr {G}}_{n+k}(f)={\mathscr {G}}_n(f)=f\) for all \(k\ge 1\).
Remark 1.4
Note that such algorithms can always be constructed when \(\tau <1\), and for some dictionaries also when \(\tau =1\) (namely, when the sup in (1.3) is attained within \({{\mathcal {D}}}\)).
We next define the three key properties that are needed to prove Lebesgue-type inequalities for WCGA. The first one is a generalization of a property given in [2, Definition 1.13].
Definition 1.5
Let Q(t) be a positive increasing function for \(t\in (0,\infty )\), with \(Q(0)=0\). We say that \(({{\mathbb {X}}},\Vert \cdot \Vert ,{{\mathcal {D}}})\) satisfies property \({\texttt {D}}(Q)\) if
The next definition coincides with property \({\texttt {A2}}\) from [7, 14].
Definition 1.7
Let \(N<D\) be positive integers and \(k_N>0\). We say that \(\Sigma _N({{\mathcal {D}}})\in {\texttt {A2}}(k_N, D)\) if
If the above holds for all \(D<\infty \), we just write \(\Sigma _N({{\mathcal {D}}})\in {\texttt {A2}}(k_N)\).
Our third definition is a slight generalization of property \({\texttt {A3}}\) from [14].
Definition 1.9
Let \(N<D\) be positive integers and let \(\{H(k)\}_{k=1}^\infty \) be an increasing sequence of positive numbers. We say \(\Sigma _N({{\mathcal {D}}})\in {\texttt {A3}}(H,D)\) if
If the above holds for all \(D<\infty \), we just write \(\Sigma _N({{\mathcal {D}}})\in {\texttt {A3}}(H)\).
Finally, we recall that a positive sequence \(\{G(k)\}_{k=1}^\infty \) is called 1-quasi-convex if
As an example, if G(t) is a positive convex function in \((0,\infty )\) with \(G(0^+)=0\), then \(\{G(k)\}_{k=1}^\infty \) is 1-quasi-convex. This is the case, for instance, for the functions
if \(p=1\) and \({\alpha }\ge 0\), or if \(p>1\) and \({\alpha }\in {{\mathbb {R}}}\) (for a sufficiently large \(c\ge e\)).
The precise statement of our main result is now the following.
Theorem 1.12
Let \(({{\mathbb {X}}},\Vert \cdot \Vert )\) be a Banach space, \({{\mathcal {D}}}\) a dictionary, \(\tau \in (0,1]\) and \({\mathscr {G}}_n:{{\mathbb {X}}}\rightarrow \Sigma _n\) a \(\tau \)-WCGA. Let \(D>N\ge 1\) be fixed. Let \(k_N>0\) and let Q(t), H(n) be positive and increasing functions such that the following properties hold
-
(i)
\(({{\mathbb {X}}},\Vert \cdot \Vert ,{{\mathcal {D}}})\) satisfies \({\texttt {D}}(Q)\)
-
(ii)
\(\Sigma _N\) satisfies property \({\texttt {A2}}(k_N,D)\).
-
(iii)
\(\Sigma _N\) satisfies property \({\texttt {A3}}(H,D)\).
Let \({\lambda }_1>1\). Assume further that the sequence
is 1-quasi-convex. If we let
then it holds
provided that \(N+\phi (N)< D\).
We now make some comments about this theorem.
-
(a)
The result obtained by Temlyakov in [14, Theorem 2.8] corresponds to the case when \({\lambda }_1\) is a (possibly large) universal constant, and
$$\begin{aligned} H(N)=V_N\,N^r {\quad \text{ and }\quad }Q(t)=c\,t^{q'}, \end{aligned}$$where \(q>1\) is the power type of the modulus of smoothness, i.e. \(\rho _{{\mathbb {X}}}(t)=O(t^q)\). In that case, the required number of iterations becomes
$$\begin{aligned} \phi (N)\,=\, C_1\,(V_N/\tau )^{q'}\,\log (1+k_N)\,N^{rq'}, \end{aligned}$$(1.16)for some \(C_1>0\), provided that \(rq'\ge 1\). Our contribution gives an additional explicit form for the constants when the parameter \({\lambda }_1\) approaches 1.
-
(b)
As we show in Proposition 2.3 below, if \({{\mathcal {D}}}\) is normalized, then condition \({\texttt {D}}(Q)\) always holds with
$$\begin{aligned} Q(t)=2{\delta }_{{{\mathbb {X}}}^*}(t/2), \end{aligned}$$where \({\delta }_{{{\mathbb {X}}}^*}(t)\) is the modulus of convexity of the dual space \({{\mathbb {X}}}^*\). This is also a new result. In many practical cases the asymptotic behavior of \({\delta }_{{{\mathbb {X}}}^*}(t)\) is well-known, so one can use property \({\texttt {D}}(Q)\) with no need to compute \(\rho _{{\mathbb {X}}}(t)\).
-
(c)
As was discussed in [2, Remark 2.10], in some special cases it is possible to prove that \(({{\mathbb {X}}},\Vert \cdot \Vert ,{{\mathcal {D}}})\) satisfies property \({\texttt {D}}(Q)\) with a function Q(t) which is considerably better than \({\delta }_{{{\mathbb {X}}}^*}(t)\) (for t near 0). For instance, if \({{\mathbb {X}}}=\ell ^p\) and \({{\mathcal {D}}}\) is the canonical basis, then one can take \(Q(t)=c_pt^{p'}\), which gives better results than \({\delta }_{{{\mathbb {X}}}^*}(t)=O(t^{\max \{p',2\}})\) when \(p>2\). Other examples (with power type) were given in [2, Proposition 4.12 and Lemma 5.7].
-
(d)
The assumption that G(n) in (1.13) is 1-quasi-convex is only made for convenience. Alternatively, one could replace G(n) by any convex majorant (hence, 1-quasi-convex). In practice, quasi-convexity is easily verified after substituting the functions Q(t) and H(n) into (1.13); see the example in (1.11).
-
(e)
As in [14], the conclusion (1.15) in the previous theorem also holds when the assumptions \({\texttt {A2}}\) and \({\texttt {A3}}\) are required only on the individual sparse element \(\Phi =\sum _{j\in T}x_j{\varphi }_j\), with \(|T|\le N\) (and not necessarily in all \(\Phi \in \Sigma _N\)). Namely, in this case the requirement would be that (1.8) and (1.10) must hold for all \(A\subset T\) and all scalars \(a_j\in {{\mathbb {K}}}\) such that \(a_j=x_j\), \(j\in A\).
Our second result is an application of Theorem 1.12 to the case when \({{\mathbb {X}}}=L^p(\log L)^{\alpha }\); see Sect. 4 below for the precise definition. We stress that, when \(1<p\le 2\), the number of iterations which are derived from the above theorem, namely
is actually (asymptotically) optimal for all \({\alpha }\in {{\mathbb {R}}}\).
Theorem 1.17
Let \(1<p<\infty \) and \({\alpha }\in {{\mathbb {R}}}\), and let \({{\mathbb {X}}}=L^p(\log L)^{\alpha }\) be as in Sect. 4. Let \({{\mathcal {D}}}\) be the (normalized) Haar basis in \({{\mathbb {X}}}\). Then
-
(a)
there exists a constant \(C>1\) such that the WCGA satisfies
$$\begin{aligned} \Big \Vert f-{\mathscr {G}}_{\phi (N)}(f)\Big \Vert \le \,2\,\sigma _N(f),\quad \forall \,f\in {{\mathbb {X}}},\;N\in {{\mathbb {N}}}, \end{aligned}$$(1.18)where
$$\begin{aligned} \phi (N)=\left\{ \begin{array}{ll} C\,N^\frac{2}{p'}\,\big (\log (e+N)\big )^{2{\alpha }_+} &{} \text{ when }\ p>2\\ C\,N\,\big (\log (e+N)\big )^{p'|{\alpha }|} &{} \text{ when }\ 1<p\le 2. \end{array}\right. \end{aligned}$$(1.19) -
(b)
if for some sequence \(\psi (N)\) the WCGA satisfies
$$\begin{aligned} \Big \Vert f-{\mathscr {G}}_{\psi (N)}(f)\Big \Vert \le \,C\,\sigma _N(f),\quad \forall \,f\in {{\mathbb {X}}},\;N\in {{\mathbb {N}}}, \end{aligned}$$(1.20)then necessarily \(\psi (N)\ge c'\, N\,\big (\log (e+N)\big )^{|{\alpha }|p'}\), for some \(c'>0\).
Remark 1.21
We remark that, when \(p>2\), it is an open question already for \(L^p\) spaces (case \({\alpha }=0\)) whether \(\phi (N)\approx N^{2/p'}\) iterations are necessary to ensure (1.18); see [16, Open Problem 8.3, p. 448].
Finally, in Sect. 5 we give a similar application in the case that \({{\mathbb {X}}}=L^p(\log L)^{\alpha }\) and \({{\mathcal {D}}}=\{e^{inx}\}_{n\in {{\mathbb {Z}}}}\) is the trigonometric system. See Theorem 5.20 below for details.
2 Preliminaries
2.1 About Seminormalization of \({{\mathcal {D}}}\)
We claim that the two properties \({\texttt {D}}(Q)\) and \({\texttt {A3}}(H,D)\) imply that the dictionary \({{\mathcal {D}}}\) must be semi-normalized. Indeed, if \(\Sigma _1\) satisfies \({\texttt {A3}}(H,D)\) then
On the other hand, \({\texttt {D}}(Q)\) implies that \(Q\big (|F_f({\varphi })|\big )\le 1\) for all \({\varphi }\in {{\mathcal {D}}}\) and \(f\in {{\mathbb {X}}}{\setminus }\{0\}\). Setting \(f={\varphi }\) and using \(F_{\varphi }({\varphi })=\Vert {\varphi }\Vert \), this gives
Conversely, suppose that \({{\mathcal {D}}}=\{{\varphi }_j\}\) is a dictionary satisfying any of the properties \({\texttt {D}}(Q)\), \({\texttt {A2}}(k_N,D)\) or \({\texttt {A3}}(H,D)\), and let \({\tilde{\varphi }}_j={{\lambda }_j}{\varphi }_j\) for scalars \({\lambda }_j\) such that
It is then easily seen that the new dictionary \({\widetilde{{{\mathcal {D}}}}}=\{{\tilde{\varphi }}_j\}\) satisfies the corresponding properties with new parameters, namely
We also remark that if \({\mathscr {G}}_N\) is \(\tau \)-WCGA for \({{\mathcal {D}}}\), then it is also a \((\tau \mathfrak {c_0}/\mathfrak {c_1})\)-WCGA for \(\widetilde{{{\mathcal {D}}}}\).
2.2 About Condition \({\texttt {D}}(Q)\)
We give a practical criterion which ensures that condition \({\texttt {D}}(Q)\) holds. Let \(({{\mathbb {X}}},\Vert \cdot \Vert )\) be a Banach space with modulus of smoothness
We denote by \({\delta }(s)={\delta }_{{{\mathbb {X}}}}(s)\) its modulus of convexity, that is
Next, we consider the following related function, introduced by Figiel [3],
Assume for simplicity that \({{\mathbb {X}}}\) is uniformly smooth, that is \(\rho _{{\mathbb {X}}}(t)=o(t)\) when \(t\rightarrow 0\) (so in particular, \({{\mathbb {X}}}\) is reflexive). Then, it is easily seen that \(Q(s)={{\widetilde{{\delta }}}}_{{{\mathbb {X}}}^*}(s)\) is a convex increasing function with \(Q(0)=0\). Moreover, it is shown in [3, Proposition 1] (see also [6, Proposition 1.e.6]) that \({{\widetilde{{\delta }}}}_{{{\mathbb {X}}}^*}(s)\) is “equivalent” to \({\delta }_{{{\mathbb {X}}}^*}(s)\) (for small s), in the sense that
Also, \({{\widetilde{{\delta }}}}_{{{\mathbb {X}}}^*}(s)\) is the greatest convex minorant of \({\delta }_{{{\mathbb {X}}}^*}(s)\). In particular, \({{\widetilde{{\delta }}}}_{{{\mathbb {X}}}^*}={\delta }_{{{\mathbb {X}}}^*}\) when the later is a convex function. In many examples of Banach spaces \({{\mathbb {X}}}\), the behavior of the function \({\delta }_{{{\mathbb {X}}}^*}(s)\) is well-known (sometimes quite explicitly). For instance, if \({{\mathbb {X}}}=L^p\), \(1<p<\infty \), then
see [6, p.63]. Our main result in this section is the following.
Proposition 2.3
If \(({{\mathbb {X}}},\Vert \cdot \Vert )\) is uniformly smooth, then every normalized dictionary \({{\mathcal {D}}}\) in \({{\mathbb {X}}}\) satisfies property \({\texttt {D}}(Q)\) with \(Q(s)=2{{\widetilde{{\delta }}}}_{{{\mathbb {X}}}^*}(s)\).
Proof
It suffices to prove (1.6) for \(f=x\in {{\mathbb {X}}}\) with \(\Vert x\Vert =1\). Let \(F_x\in {{\mathbb {X}}}^*\) be the norming functional of \({{\mathbb {X}}}\), and given \({\varphi }\in {{\mathcal {D}}}\), let \(\nu =\overline{\text{ sign }}\,F_x({\varphi })\). Then, for every \(t\ge 0\), using [2, Proposition 2.1], we have
Taking the infimum over all \(t\ge 0\) we obtain
\(\square \)
Remark 2.4
If the dictionary is not normalized, but we assume that \(0<\Vert {\varphi }\Vert \le \mathfrak {c_1}\), for all \({\varphi }\in {{\mathcal {D}}}\), then the previous result gives
So property \({\texttt {D}}(Q)\) holds with \(Q(s)=2\,{{\widetilde{{\delta }}}}_{{{\mathbb {X}}}^*}(s/\mathfrak {c_1})\), which is also a function equivalent to \({\delta }_{{{\mathbb {X}}}^*}(s)\).
2.3 About Condition \({\texttt {A3}}(H)\)
In practice, it is quite common that \(({{\mathbb {X}}},{{\mathcal {D}}})\) satisfies properties \({\texttt {A2}}\) or \({\texttt {A3}}\) with depth \(D=\infty \). Our first observation is that this implies that \({{\mathcal {D}}}\) has a biorthogonal dual system.
Lemma 2.5
Let \({{\mathcal {D}}}=\{{\varphi }_j\}_{j=1}^\infty \) be a dictionary in \({{\mathbb {X}}}\). Assume that one of the following properties hold
-
(i)
There exists \(k_1>0\) such that \(\Sigma _1({{\mathcal {D}}})\in {\texttt {A2}}(k_1;D)\), for all \(D<\infty \)
-
(ii)
There exists \(H(1)>0\) such that \(\Sigma _1({{\mathcal {D}}})\in {\texttt {A3}}(H;D)\), for all \(D<\infty \).
Then, there exists \(\{{\varphi }^*_j\}_{j=1}^\infty \) in \({{\mathbb {X}}}^*\) such that \(\{{\varphi }_j,{\varphi }^*_j\}_{j=1}^\infty \) is a biorthogonal system, ie
Proof
This is a consequence of [11, Theorem 6.1, page 54]. Indeed, if (i) holds then
which implies biorthogonality by [11, Theorem 6.1, “\(8^\textrm{o}\Rightarrow 2^\textrm{o}\)”]. Similarly, if (ii) holds then
which implies biorthogonality by [11, Theorem 6.1, “\(3^\textrm{o}\Rightarrow 2^\textrm{o}\)”]. \(\square \)
So under this situation, the dictionary \({{\mathcal {D}}}\) generates a dual system \({{\mathcal {D}}}^*\). Then, a variation of [2, Lemma 2.17] gives the following.
Lemma 2.6
Let \({{\mathcal {D}}}=\{{\varphi }_j\}_{j=1}^\infty \) be a dictionary, with dual system \({{\mathcal {D}}}^*=\{{\varphi }^*_j\}_{j\ge 1}\). Then, \(\Sigma _N\in {\texttt {A3}}(H,D)\), for all \(N<D<\infty \), if we choose
Proof
Take sets \(A\subset B\), with \(|A|\le N\), and scalars \(a_j\in {{\mathbb {K}}}\). Let \({\varepsilon }_j=\mathop {\overline{\textrm{sign}}}a_j\), and denote
Then
\(\square \)
In practice, the sequence H(n) in (2.7) is equivalent to the fundamental function of \({{\mathcal {D}}}^*\) in \({{\mathbb {X}}}^*\), which in many examples has an explicit expression.
2.4 Quasi-convex Sequences
Given a positive sequence \(w=\{w(j)\}_{j=1}^\infty \) we define its associated summing sequence \(\widetilde{w}\) by
Lemma 2.9
If \(w=\{w(j)\}_{j=1}^\infty \) is non-decreasing then for all \(N\in {{\mathbb {N}}}\)
Proof
Let \({\Delta }_j=\{n\in {{\mathbb {N}}}\mid 2^j\le n<2^{j+1}\}\), which has cardinality \(2^j\), \(j=0,1,\ldots \) Then, if J is the largest integer with \(2^J\le N\) we have
\(\square \)
Lemma 2.10
If \(w=\{w(j)\}_{j=1}^\infty \) is 1-quasi-convex then
-
(a)
\(\widetilde{w}(N)\le w(N)\) for all \(N\in {{\mathbb {N}}}\)
-
(b)
\(\widetilde{w}\) is superadditive, that is,
Proof
The assertion a) follows from the definition of 1-quasi-convex, since
The assertion b) follows similarly from
\(\square \)
3 The Proof of Theorem 1.12
In this section we give the proof of Theorem 1.12. We shall follow the main steps in the original proof of Temlyakov, see [14, Theorem 2.8] or [16, Theorem 8.7.18], adapted to the new properties \({\texttt {D}}(Q)\) and \({\texttt {A3}}(H,D)\). For completeness, we give self-contained arguments of all the steps, although the main changes will mostly appear in steps 1 and 4.
3.1 Step 1. The Iiteration Theorem
The following result is a generalization of [2, Theorem 3.1], so we follow the notation presented there. Namely, if \(f\in {{\mathbb {X}}}\setminus \{0\}\), then we write
for the remainder and the supporting set of the n-th WCGA applied to f; see Definition 1.2. Also, if \(\Phi =\sum _{j\in T}a_j{\varphi }_j\in \Sigma _N\) and \(A\subset T\), then we denote
We shall also make frequent use of [2, Lemma 2.12], which asserts that
Theorem 3.2
Let \(D>N\ge 1\). Assume that
-
(i)
\(({{\mathbb {X}}},\Vert \cdot \Vert ,{{\mathcal {D}}})\) satisfies \({\texttt {D}}(Q)\)
-
(ii)
\(\Sigma _N\) satisfies property \({\texttt {A3}}(H,D)\)
Then, for every \(f\in {{\mathbb {X}}}\setminus \{0\}\), \(\Phi =\sum _{j\in T}a_j{\varphi }_j\in \Sigma _N\), and \({\lambda }>1\), and for all integers \(m,M\ge 0\) such that \(N+m+M<D\) the following holds
for all sets \(A\subset T_k\) (with \(A\not =\emptyset \)), \(B=T_k\setminus A\) and all \(k\in [0,m)\), and where
Proof
Given a fixed \(n\in [m,m+M)\), condition \({\texttt {D}}(Q)\) implies
By definition of the WCGA and (3.1), for each (non-empty) \(A\subset T\) we have
Now, the assumption \(\Sigma _N\in {\texttt {A3}}(H,D)\) implies that
In order to apply \({\texttt {A3}}\) we have used that \(|A\cap T_n|\le |T|\le N\) and \(|T\cup {\Gamma }_n|\le N+n\le N+m+M<D\). Thus, inserting these estimates into (3.5) we obtain
which is valid for all sets \(A\subset T\).
Fix now an integer \(k\in [0,m]\) and a set \(A\subset T_k\), and let \(B=T_k\setminus A\). Since \({\Gamma }_k\subset {\Gamma }_n\) we can use (3.1) to obtain
So, we conclude that
Using in the denominator that \(\Vert \Phi -f\Vert \le \Vert f_n\Vert \) (when the numerator is not zero), this further simplifies into
where we have let \(u=(\Vert f-\Phi \Vert +\Vert \Phi _B\Vert )/\Vert f_n\Vert \). Now, call
Observe from (2.1) and (2.2) that \(\beta <Q(1/H(1))\le 1\). Now, if \(u\le 1/{\lambda }\) then we have
On the other hand, if \(u\ge 1/{\lambda }\), by definition of u we have
and therefore,
So, combining (3.6) and (3.7), and calling \(v=\Vert f-\Phi \Vert +\Vert \Phi _B\Vert \) we obtain
Since \(\Vert f_{n+1}\Vert \le \Vert f_n\Vert \) (and \(\beta <1\)) this implies
We can now iterate for all \(n\in [m,m+M)\) to obtain
Finally, using the value of v and \(1-\beta \le e^{-\beta }\) we obtain
This corresponds exactly to (3.3). \(\square \)
3.2 Step 2: Selection of Sets \(A_j\)
In the next step, we shall follow [16, pp. 435–437], and iteratively apply Theorem 3.2, with a suitably chosen selection of sets \(A_j\), in order to obtain the following result. We have adapted the proof to include the new conditions \({\texttt {D}}(Q)\) and \({\texttt {A3}}(H, D)\), and have made more precise the value of the constants.
Theorem 3.8
Let \(({{\mathbb {X}}},\Vert \cdot \Vert ,{{\mathcal {D}}})\) and \(1\le N<D\) be such that
-
(i)
\(({{\mathbb {X}}},\Vert \cdot \Vert ,{{\mathcal {D}}})\) satisfies \({\texttt {D}}(Q)\)
-
(ii)
\(\Sigma _N\) satisfies property \({\texttt {A2}}(k_N,D)\).
-
(iii)
\(\Sigma _N\) satisfies property \({\texttt {A3}}(H,D)\).
Given \({\lambda }>1\) and \({\delta }>0\), there exists \(\beta _0=\beta _0({\lambda }, {\delta }, k_N)>0\) such that, if \(\beta \ge \beta _0\) and \(\Phi =\sum _{j\in T}a_j{\varphi }_j\in \Sigma _N{\setminus }\{0\}\), then there exist positive integers \(L, m_L\in {{\mathbb {N}}}\) such that
(with G(n) defined in (3.4)), and so that for all \(x\in {{\mathbb {X}}}\setminus \{0\}\) it holds
provided that \(N+m_L<D\). Moreover, we can set
Proof
Let \(n\ge 0\) be such that \(2^{n-1}<|T|\le 2^n\). Now, for each \(j=1,2,\ldots , n+1\), choose \(A_j\subset T\) such that
Then, define \(B_j=T\setminus A_j\). Picking the sets \(A_j\) with smallest cardinality we may assume that \(|A_1|\le |A_2|\le \cdots \le |A_{n+1}|\) (although these sets may not be nested). As special cases we define
This construction implies the following
This will be a crucial argument later to conclude the proof of the theorem.
Let \(\beta >0\) be a large number to be determined later, and define
For the moment assume that \(\beta \) is large enough so that \(\eta <1/2\). With that choice of \(\beta \) we have \(b>1\). Now, pick the first positive integer \(L=L(b, \Phi )\in {{\mathbb {N}}}\) such that
Note that we could have \(L=1\) if the first condition never holds, i.e. whenever \(\Vert \Phi \Vert =\Vert \Phi _{B_0}\Vert \ge b\,\Vert \Phi _{B_1}\Vert \). At the other extreme, we always have
which implies that \(1\le L\le n+1\). Thus,
which is the first assertion in (3.9). Observe also that \(A_L\not =\emptyset \), since otherwise we would have \(A_j=\emptyset \), for all \(j\le L\), and hence \(\Phi _{B_L}=\Phi _{B_{L-1}}=\Phi \), which would contradict the right hand side of (3.11).
We now apply iteratively Theorem 3.2. Consider the numbers \(m_0=0\) and
Actually, to avoid trivial cases, we should restrict to \(j=j_0,\ldots , L\), where \(j_0\) is the first integer such that \(|A_{j_0}|\not =0\) (and let \(m_j=0\) for \(j<j_0\)). We also assume that \(\beta \) is large enough so that
Observe that
since G is increasing. This is the second inequality in (3.9).
Now, for each \(j=j_0,\ldots , L\) we apply Theorem 3.2 with \(k=0\), \(m=m_{j-1}\), \(M=\lfloor \beta \,G(|A_j|)\rfloor \) and \(A=A_j\) to obtain
using in the last line that \(\lfloor a\rfloor \ge a/2\) if \(a\ge 1\). Observe that the above inequalities hold trivially for \(1\le j<j_0\) (if there is any such j) since
and in this case \(B_j=T\). Therefore, we can iterate the inequalities to obtain
For the first summand we also have
We now use the crucial assumption (3.11), that is,
which inserted into the above expression gives
using in the last step the choice of \(b=1/(2\eta )\).
On the other hand, we can use that \(\Sigma _N\) satisfies property \({\texttt {A2}}(k_N, D)\) to obtain the following estimate
At this point we wish to use the Key Fact in (3.10). So we distinguish two cases.
Case 1: \(\Vert \Phi -x\Vert < A\, \Vert \Phi _{B_{L-1}}\Vert \), for some \(A>0\) to be determined. Then
In this case, we wish to select A and \(\eta \) (and hence \(\beta \)) so that
which by the Key Fact would imply that
Case 2: \(\Vert \Phi _{B_{L-1}}\Vert \le A^{-1}\,\Vert \Phi -x\Vert \). In this case, using (3.13) we have
So, we wish to select A and \(\eta \) (hence \(\beta \)) such that
Overall, we have reduced the theorem to find numbers A and \(\eta \) so that (3.15) and (3.16) hold. Writing \(A=\eta \,B\), this amounts to find B and \(\eta \) so that
This is clearly possible if B is chosen sufficiently large and \(\eta \) sufficiently small. In order to make an explicit choice, we let \(B=8/{\delta }\), so we need to select \(\eta \) so that
If we impose the first condition, the second one will hold provided
That is, we can choose
with the last equality following easily from \(k_N\ge 1\) and \({\lambda }\ge 1\). So, simplifying a bit we can choose
and using that \(\eta =e^{-\beta /2}\), we find the expression
We finally observe that (3.12) is also satisfied, as in fact we have \(G(1)\ge 1\). This is a simple consequence of
with the second inequality due to (2.1) (for any \({\varphi }\in {{\mathcal {D}}}\)), and the last one due to \({\texttt {D}}(Q)\). \(\square \)
Remark 3.17
In order to ensure that \(\Vert x_{m_L}\Vert \le 2\Vert x-\Phi \Vert \) we must choose \({\lambda }\) and \({\delta }\) so that \((1+{\delta }){\lambda }=2\). For instance, \({\lambda }=\sqrt{2}\) and \({\delta }=\sqrt{2}-1\) will give the value
3.3 Step 3
The next step is a slight generalization of the previous Theorem 3.8 (which would be the special case \(k=0\)).
Theorem 3.18
Let \(({{\mathbb {X}}},\Vert \cdot \Vert ,{{\mathcal {D}}})\) and \(1\le N<D\) be such that
-
(i)
\(({{\mathbb {X}}},\Vert \cdot \Vert ,{{\mathcal {D}}})\) satisfies \({\texttt {D}}(Q)\)
-
(ii)
\(\Sigma _N\) satisfies property \({\texttt {A2}}(k_N,D)\).
-
(iii)
\(\Sigma _N\) satisfies property \({\texttt {A3}}(H,D)\).
Given \({\lambda }>1\) and \({\delta }>0\), let \(\beta _0=\beta _0({\lambda }, {\delta }, k_N)>0\) be as in Theorem 3.8, and let \(\beta \ge \beta _0\). If \(x\in {{\mathbb {X}}}\) and \(\Phi =\sum _{j\in T}a_j{\varphi }_j\in \Sigma _N\) are not null, and if \(k\in {{\mathbb {N}}}_0\) is such that
then there exist integers \(L\in {{\mathbb {N}}}\) and \(m_L\ge k+1\) such that
and so that
provided that \(N+m_L<D\).
Proof
Apply the construction in the first part of Theorem 3.8 to the vector \(\Phi _{T_k}\) (instead of \(\Phi \)). So for \(\eta \) and b fixed as above, this gives an integer \(L\in {{\mathbb {N}}}\) such that \(2^{L-2}<|T_k|\) and sets \(A_j\subset T_k\) and \(B_j=T_k{\setminus } A_j\) such that the inequalities in (3.11) hold.
At this point we let \(m_0=k\) and consider
where \(j_0\) is the first integer such that \(|A_{j_0}|\not =0\). Otherwise we let \(m_j=m_0=k\) when \(1\le j<j_0\). As before, this choice (and the size of the sets \(A_j\)) gives the second assertion in (3.19).
Now, if \(j_0\le j\le L\) we apply Theorem 3.2 with \(m=m_{j-1}\), \(M=\lfloor \beta G(|A_j|)\rfloor \) and \(A=A_j\) to obtain
When \(0\le j<j_0\), we have instead
since \(A_j=\emptyset \) and hence \(B_j=T_k\). Thus, we can proceed exactly as we did in (3.13) to obtain the same conclusion, namely
On the other hand, using property \({\texttt {A2}}(k_N, D)\) we obtain
which is the analogous inequality to (3.14) in the previous theorem.
At this point one considers the same two cases as in the lines following (3.14). Namely
Case 1: \(\Vert \Phi -x\Vert < A\, \Vert \Phi _{B_{L-1}}\Vert \), with the same \(A>0\) as in Theorem 3.8. This implies
so by the construction of the sets \((A_j,B_j)\) and the Key Fact one obtains
Case 2: \(\Vert \Phi _{B_{L-1}}\Vert \le A^{-1}\,\Vert \Phi -x\Vert \). In this case, the same reasoning as in Theorem 3.8 gives
This completes the proof of Theorem 3.18. \(\square \)
3.4 Step 4: Conclusion of the Proof of Theorem 1.12
This part of the proof requires substantial modifications compared to [14, 16], so we present it in detail.
Write \({\lambda }_1=(1+{\delta }){\lambda }\), say with
The iterative process discussed in the previous subsections produces a positive constant \(\beta = 2\,\log \Big [\frac{8k_N(1+{\lambda }_1)}{\sqrt{{\lambda }_1}-1}\Big ]\), and the following sequences of numbers
-
there exist positive integers \(L_1\) and \(m_{L_1}\) such that
$$\begin{aligned} m_{L_1}\le \beta \sum _{j=1}^{L_1} G(2^{j-1}){\quad \text{ and }\quad }2^{L_1-2}\le |T|, \end{aligned}$$with the property that
$$\begin{aligned} \text{ either }\quad \Vert x_{m_{L_1}}\Vert \le {\lambda }_1\,\Vert x-\Phi \Vert \quad \text{ or }\quad |T\cap {\Gamma }_{m_{L_1}}|\ge 2^{L_1-2}. \end{aligned}$$In the first case one stops; if not one iterates and applies Theorem 3.18 with \(k=m_{L_1}\), which implies
-
there exist positive integers \(L_2\) and \(m_{L_2}>m_{L_1}\) such that
$$\begin{aligned} m_{L_2}-m_{L_1}\le \beta \sum _{j=1}^{L_2} G(2^{j-1}){\quad \text{ and }\quad }2^{L_2-2}\le |T_{m_{L_1}}|, \end{aligned}$$(3.20)with the property that
$$\begin{aligned} \text{ either }\quad \Vert x_{m_{L_2}}\Vert \le {\lambda }_1\,\Vert x-\Phi \Vert \quad \text{ or }\quad |T_{m_{L_1}}\cap {\Gamma }_{m_{L_2}}|\ge 2^{L_2-2}. \end{aligned}$$Again, in the first case one stops; if not one applies iteratively Theorem 3.18, with values of \(k=m_{L_i}\), \(i=2,\ldots , s-1\), until some step s, where can one ensure that
-
there are positive integers \(L_s\) and \(m_{L_s}>m_{L_{s-1}}\) such that
$$\begin{aligned} m_{L_s}-m_{L_{s-1}}\le \beta \sum _{j=1}^{L_s} G(2^{j-1}){\quad \text{ and }\quad }2^{L_s-2}\le |T_{m_{L_{s-1}}}|, \end{aligned}$$(3.21)where
$$\begin{aligned} \left\{ \begin{array}{l} \text{ either }\quad \Vert x_{m_{L_s}}\Vert \le {\lambda }_1\,\Vert x-\Phi \Vert ,\quad \\ \text{ or } \quad |T_{m_{L_{s-1}}}\cap {\Gamma }_{m_{L_s}}|\ge 2^{L_s-2} \quad \text{ and }\quad m_{L_s}\ge 2\beta \,{{\widetilde{G}}}(2N). \end{array}\right. \end{aligned}$$(3.22)
Here G(n) denotes the sequence defined in (3.4), and the notation \({{\widetilde{G}}}(n)\) stands for the associated summing sequence as in (2.8).
In the first case of (3.22) one stops; if not, we shall show that the greedy algorithm actually covers the whole set T, that is
This would imply that \(x_{m_{L_s}}=0\), and so we would also stop.
Let us prove (3.23). Here we shall use the assumption that the sequence G(n) in (3.4) is increasing and 1-quasi-convex. Observe that
Now, by Lemma 2.9 and the inductive assumptions, see (3.20), for each \(i=1,\ldots ,s\), we have
with the notation \(m_{L_0}=0\). Thus, applying the (non-decreasing) function \(2\beta {{\widetilde{G}}}(2\cdot )\) to both sides of (3.24) and using part b) of Lemma 2.10 we obtain
using in the last line (3.25) and the second assertion in (3.22). Since \({{\widetilde{G}}}\) is increasing this implies
which proves (3.23).
Thus, the process will indeed end after \(m_{L_s}\) iterations. We now estimate this number using the remaining conditions in (3.21). Since the last inequality in (3.22) occurs for the first time at step s, we must have
Thus,
Therefore, using also part a) of Lemma 2.10, we see that (1.15) will be true with
as asserted in (1.14). \(\square \)
4 An Application: WCGA in \(L^p(\log \,L)^{\alpha }\) Spaces
4.1 Property \({\texttt {D}}(Q)\) in \(L^p(\log \,L)^{\alpha }\)
In this section we shall apply Theorem 1.12 in the case when
Following [1, Definition IV.6.11], this is the set of all measurable \(f:{{\mathbb {R}}}^d\rightarrow {{\mathbb {C}}}\) such that
These classes satisfy the elementary inclusions
We shall regard \({{\mathbb {X}}}\) as an Orlicz space \(L^\Phi \) associated with the function
which for a sufficiently large \(c>1\) is a (smooth) Young function.Footnote 1 The corresponding (Luxemburg) norm is then defined by
Let \(\Psi \) be the complementary functionFootnote 2 of \(\Phi (t)\). Then it is known that \((L^\Phi )^*=L^\Psi \) (isometrically, when the latter space is endowed with the Orlicz norm); see [1, Corollary IV.8.15]. In these examples it is not difficult to check that
and
see e.g. [5, Theorem I.7.2].
We recall how the norming functional \(F_f\) of a (normalized) element \(f\in L^\Phi \) is defined; see [5, Theorem 18.5]. Let \(P(t)=\Phi '(t)\), \(t\ge 0\), and for \(z\in {{\mathbb {C}}}\setminus \{0\}\) let \(P(z)=\overline{{\text {sign}}}(z)P(|z|)\). Then \(F_f\) is explicitly given by
In our case of interest we will have \(P(z)={\bar{z}}\,|z|^{p-2}\,\big (\log (c+|z|)\big )^{{\alpha }p}\), \(z\in {{\mathbb {C}}}\), and hence
with \(A(f)=\int _{{{\mathbb {R}}}^d}|f|^{p}\,(\log (c+|f|))^{{\alpha }p}\,dx\).
The moduli of smoothness and convexity for Orlicz spaces \(L^\Phi \) have been studied in [3, 8]. According to [8, Theorem 1], there exists a Young function \(\bar{\Phi }\), equivalent to \(\Phi \), such that
under suitable doubling conditions in \(\Phi (t)\) and \(\Psi (t)\) (which are always held in the cases considered in (4.1) and (4.2)). Moreover, our specific examples satisfy the regularity conditions stated in [3, Proposition 19], so one may actually take \(\bar{\Phi }=\Phi \).
Therefore, inserting into (4.4) the expressions for \(\Phi \) and \(\Psi \) from (4.1) and (4.2), and performing some straightforward computations, one obtains the following result. Here we use the standard notation \({\alpha }={\alpha }_+-{\alpha }_-\), where
Proposition 4.5
Let \(1<p<\infty \) and \({\alpha }\in {{\mathbb {R}}}\), and let \({{\mathbb {X}}}=L^p(\log L)^{\alpha }\) as above. Then, for the Luxemburg norm associated with \(\Phi (t)\) in \({{\mathbb {X}}}\) it holds
-
if \(p>2\) then
$$\begin{aligned} \rho _{{{\mathbb {X}}}}(t)\,\lesssim \, t^2{\quad \text{ and }\quad }{\delta }_{{{\mathbb {X}}}^*}(s)\,\gtrsim \,s^2, \end{aligned}$$ -
if \(1<p\le 2\) then
$$\begin{aligned} \rho _{{{\mathbb {X}}}}(t)\,\lesssim \, t^p\,(\log (e+\tfrac{1}{t}))^{p\,{\alpha }_-}{\quad \text{ and }\quad }{\delta }_{{{\mathbb {X}}}^*}(s)\,\gtrsim \, \frac{s^{p'}}{(\log (e+\frac{1}{s}))^{p'\,{\alpha }_-}}. \end{aligned}$$
In view of the discussion in Sect. 2.2, we then obtain the following.
Corollary 4.6
Let \(1<p<\infty \) and \({\alpha }\in {{\mathbb {R}}}\), and let \({{\mathbb {X}}}=L^p(\log L)^{\alpha }\) as above. Then, for every normalized dictionary \({{\mathcal {D}}}\) in \({{\mathbb {X}}}\), property \({\texttt {D}}(Q)\) holds with
for a suitably small constant \(c_0>0\).
4.2 The Haar System in \(L^p(\log \,L)^{\alpha }\)
Next we consider the dictionary \({{\mathcal {D}}}=\{\psi _j\}_{j=1}^\infty \) in \({{\mathbb {X}}}=L^p(\log L)^{\alpha }\) given by the normalized Haar basis in \({{\mathbb {R}}}^d\) (or any sufficiently smooth wavelet basis). These bases are unconditional, so in this case property \({\texttt {A2}}(k_N,D)\) will hold with \(k_N=O(1)\) (and any \(D<\infty \)). It remains to verify property \({\texttt {A3}}(H,D)\). As mentioned in Lemma 2.6, this property holds with
(also for all \(D<\infty \)). Here \(\{\psi ^*_j\}\) is the dual dictionary, which is again the Haar basis, this time normalized in \({{\mathbb {X}}}^*\). Since the basis is unconditional, the parameter H(N) is equivalent to the upper democracy function of the dual space \({{\mathbb {X}}}^*\), that is
Democracy functions, for the Orlicz classes \(L^\Phi \), were studied in [4], where it was proved that, if the Boyd indices of \(\Phi \) are non trivial, then
where \({\varphi }(t):=1/\Phi ^{-1}(1/t)\) is the fundamental function of \(L^\Phi \).
In our case of interest, where \(\Phi (t)\) satisfies (4.1), we have
see [4, Proposition 3.4]. Thus, for the dual space \({{\mathbb {X}}}^*=L^\Psi \) with \(\Psi \) as in (4.2) we have
Overall we conclude that property \({\texttt {A3}}(H)\) holds with
Thus, combining (4.7) and (4.8), we see that, for \(p>2\) we have
while for \(1<p\le 2\) we have
4.3 Proof of Theorem 1.17.a
Collecting the values of the parameters G(N) obtained in the previous subsection, and inserting them into Theorem 1.12, we deduce the first assertions (1.18) and (1.19) in Theorem 1.17.
4.4 Proof of Theorem 1.17.b
We shall use the following result whose proof can be found in [4, Lemma 3.1]. For simplicity in the notation, we assume in this section that the underlying space \({{\mathbb {R}}}^d\) has dimension \(d=1\).
Lemma 4.9
Let \({{\mathbb {X}}}=L^\Phi ({{\mathbb {R}}})\) be an Orlicz space with non-trivial Boyd indices, and let \({{\mathcal {D}}}=\{h_I\}\) be the (normalized) Haar basis in \({{\mathbb {X}}}\). Then, if A is a finite collection of disjoint dyadic intervals with the same size s, then
where \({\varphi }(t)\) is the fundamental function of \({{\mathbb {X}}}\).
We now show the lower bound for the function \(\psi (N)\) stated in (1.20).
Proof of (1.20)
Write \({{\mathcal {D}}}=\{h_I\}\) where \(h_I\) is the (normalized) Haar function supported in I, and I runs over all dyadic intervals in \({{\mathbb {R}}}\). Pick any two collections A and B, of pairwise disjoint dyadic intervals with cardinalities \(|A|=N\) and \(|B|=M\), such that
For instance, we could take
with \(M=2^m\). For \(b>0\) to be determined, consider the function
Using Lemma 4.9 and \({\varphi }(t)\,\approx \,t^{1/p}\,\big (\log (e+\frac{1}{t})\big )^{\alpha }\), observe that
Also, since \(\Vert h_I\Vert _{L^\Phi }=1\) we have
In particular, if \(I\in B\) we have
and similarly, if \(I\in A\) we have
Using the formula for the norming functional in (4.3) we see that
for some \(C(f)>0\). In view of (4.12) and (4.13), the logarithmic factors inside the integrals are approximately constant, so can be disregarded. Also, (4.11) implies
so we have
Thus, the above quantities are approximately the same provided we choose
Therefore, if \(c_1>0\) is chosen properly, the WCGA, \({\mathscr {G}}_n(f)\), can be formed either by selecting consecutive elements I from A (if \(n\le N\)), or by selecting consecutive elements I from B (if \(n\le M/2\)). To verify these assertions one should note that the equivalences in (4.14) remain also trueFootnote 3 when f is replaced by the remainder \(f-{\mathscr {G}}_n(f)\).
So suppose now that (1.20) holds. If \({\alpha }\ge 0\), we let \(N=\psi (M)\), and in view of the previous comment we can select \(c_1\) such that \({\mathscr {G}}_N(f)=f_1\). Then
which in view of (4.15) and (4.10) implies
This proves the assertion in the Theorem when \({\alpha }\ge 0\).
If \({\alpha }\le 0\), then we take \(M=2\psi (N)\), and select \(c_1\) such that \({\mathscr {G}}_{M/2}(f)=b\sum _{I\in B'}h_I\), for some \(B'\subset B\) with \(|B'|=M/2\). Then
which this time implies
Solving for M this gives
This establishes (1.20), and therefore completes the proof of Theorem 1.17. \(\square \)
5 WCGA for Trigonometric System in \(L^p(\log \,L)^{\alpha }\)
In this section we give a second application of Theorem 1.12, this time to the trigonometric system in the torus \({{\mathbb {T}}}\equiv [-\pi ,\pi )\), that is,
So, from now on, all functions \(f\in L^p(\log \,L)^{\alpha }\) are understood as defined in \({{\mathbb {T}}}\). Otherwise, we regard \(L^p(\log \,L)^{\alpha }\) as an Orlicz space \(L^\Phi \) in the same sense as in Sect. 4. Since [8] covers also this setting, the estimates for the moduli of convexity and smoothness in Proposition 4.5 remain true, and so does the estimate (4.7) for the function Q(s) in Corollary 4.6.
We still have to compute the parameters \(k_N\) and H(N). To do so, we shall make use of the following interpolation lemma.
Lemma 5.1
Consider the Young function \({\bar{\Phi }}(t)=t^p\,\big (\log (c+t)\big )^{{\alpha }p}\), for some \(c\ge e\). Assume that
Then,
Proof
We may assume that \(\Vert f\Vert _2=1\). Define the functions
By the lattice property of the Luxemburg norm in \(L^{\bar{\Phi }}\) we have
using that b(t) is increasing under the conditions in (5.2). So, it suffices to show that
as this will imply that \(\big \Vert a(f)\big \Vert _{L^{\bar{\Phi }}}\le 1\). Write
Observe that, regardless of the sign of \({\alpha }\in {{\mathbb {R}}}\), we always have
Thus,
\(\square \)
Remark 5.3
Observe that, when the indices p and \({\alpha }\) satisfy (5.2), then it holds
This is easily proved using that \(t^2\lesssim \Phi (t)\) for \(t\ge 1\), since
Likewise, by duality, one proves that \(L^{2}({{\mathbb {T}}})\hookrightarrow L^p(\log L)^{\alpha }\) when
5.1 Property \({\texttt {A3}}\) for \({{\mathcal {T}}}\) in \(L^p(\log L)^{\alpha }\)
Lemma 5.6
Let \(1<p<\infty \) and \({\alpha }\in {{\mathbb {R}}}\). Then, for all \(|{\varepsilon }_n|\le 1\) and all \(A\subset {{\mathbb {Z}}}\) with \(|A|\le N\) it holds
Proof
When p and \({\alpha }\) satisfy (5.5), the right hand side of (5.7) is \(\approx N^{1/2}\), so the assertion follows from the inclusion \(L^{2}\hookrightarrow L^p(\log L)^{\alpha }\). On the other hand, if p and \({\alpha }\) satisfy (5.2), then by Lemma 5.1 we have
where \(b(t)=t^{1-\frac{2}{p}}\,\big (\log (c+t^\frac{2}{p})\big )^{{\alpha }}\). Applying this to \(f=\sum _{n\in A}{\varepsilon }_n e^{inx}\), and using that b(t) is increasing and
one easily obtains (5.7). \(\square \)
Remark 5.9
The upper bounds in (5.7) cannot be improved, even when all signs \({\varepsilon }_n=1\). Indeed, if one considers the Dirichlet kernel \(D_N(x)=\sum _{|n|\le N} e^{inx}\), then we have
see e.g. [9, Lemma 3.1]. On the other hand, if A is a lacunary set (say, \(A=\{2^j\}_{j=1}^N\)), then
Indeed, this is easily obtained from a similar result for all the \(L^q\) spaces, \(0<q<\infty \), and the inclusions \(L^{p+{\varepsilon }}\hookrightarrow L^p(\log L)^{\alpha }\hookrightarrow L^{p-{\varepsilon }}\).
Corollary 5.11
Let \(1<p<\infty \) and \({\alpha }\in {{\mathbb {R}}}\). Let \({{\mathbb {X}}}=L^p(\log L)^{\alpha }\) and \({{\mathcal {D}}}=\{e^{inx}\}_{n\in {{\mathbb {Z}}}}\) in \({{\mathbb {T}}}\). Then, property \({\texttt {A3}}(H)\) holds with
Proof
Apply Lemmas 2.6 and 5.6, and the duality relation \({{\mathbb {X}}}^*=L^{p'}(\log L)^{-{\alpha }}\). \(\square \)
5.2 Property \({\texttt {A2}}\) for \({{\mathcal {T}}}\) in \(L^p(\log L)^{\alpha }\)
Given a finite set \(A\subset {{\mathbb {Z}}}\), we denote
where \({{\hat{g}}}(n)\), \(n\in {{\mathbb {Z}}}\), are the Fourier coeffients of \(g\in L^1({{\mathbb {T}}})\). As noticed in [2, Lemma 2.15], property \({\texttt {A2}}(k_N)\) holds trivially when we let
In this section we compute this last expression.
Lemma 5.13
Let \(2<p<\infty \) and \({\alpha }\in {{\mathbb {R}}}\), or \(p=2\) and \({\alpha }\ge 0\). Then,
Proof
Let \(g\in L^p(\log L)^{\alpha }\). Using the inequality in (5.8) from the previous section, applied to \(f=S_A(g)\) we see that
Now,
so using that b(t) is increasing we obtain
On the other hand, the inclusion in (5.4) gives
Thus, we obtain
\(\square \)
Since \(S_A^*=S_A\), by duality one obtains the following complementary result.
Lemma 5.15
Let \(1<p<2\) and \({\alpha }\in {{\mathbb {R}}}\), or \(p=2\) and \({\alpha }\le 0\). Then,
Remark 5.17
The estimate in (5.16) is best possible (and by duality, also (5.14)). One can prove this by noticing that there exist choices of signs \(\pm 1\) such that
This last assertion can be easily obtained from a similar property of the \(L^q\)-spaces, and the inclusions at the end of Remark 5.9. From (5.18), there will be a set \(A\subset [-N,N]\), either corresponding to the positive or the negative signs, so that
Thus, omitting the subindices \(L^p(\log L)^{\alpha }\) from the norms, we have
using (5.10) in the last step.
Corollary 5.19
Let \(1<p<\infty \) and \({\alpha }\in {{\mathbb {R}}}\). Let \({{\mathbb {X}}}=L^p(\log L)^{\alpha }\) and \({{\mathcal {D}}}=\{e^{inx}\}_{n\in {{\mathbb {Z}}}}\) in \({{\mathbb {T}}}\). Then, property \({\texttt {A2}}(k_N)\) holds with
Proof
Apply Lemmas 5.13 and 5.15 to the expression in (5.12). \(\square \)
5.3 WCGA for \({{\mathcal {T}}}\) in \(L^p(\log L)^{\alpha }\)
Combining the estimates from the previous subsections, we obtain the following.
Theorem 5.20
Let \(1<p<\infty \) and \({\alpha }\in {{\mathbb {R}}}\). Let \({{\mathbb {X}}}=L^p(\log L)^{\alpha }\) and \({{\mathcal {D}}}=\{e^{inx}\}_{n\in {{\mathbb {Z}}}}\) in \({{\mathbb {T}}}\). Then, there exists a constant \(C>1\) such that the WCGA satisfies
where
Proof
Combine Theorem 1.12, with the estimates for Q(t), H(N) and \(k_N\) in Corollaries 4.6, 5.11 and 5.19. \(\square \)
Remark 5.21
The necessity of the log factors and the powers in the above expression of \(\phi (N)\) is not known, even in the case \({\alpha }=0\) (except, of course, if \({{\mathbb {X}}}=L^2\)). See [16, Open Question 8.2].
Notes
In the latter case, we have restricted to \(n\le M/2\) to ensure that (4.12) continues to hold when f is replaced by \(f-{\mathscr {G}}_n(f)\). Indeed, in such case one would use that \(\Vert \sum _{I\in B'}h_I\Vert \approx 1/{\varphi }(1/M)\), when \(B'\subset B\) with \(|B'|\ge M/2\), by Lemma 4.9.
References
Benett, C., Sharpley, R.C.: Interpolation of Operators. Academic Press, London (1988)
Dilworth, S., Garrigós, G., Hernández, E., Kutzarova, D., Temlyakov, V.: Lebesgue-type inequalities in greedy approximation. J. Funct. Anal. 280(5), 108885 (2021)
Figiel, T.: On the moduli of convexity and smoothness. Stud. Math. 56, 121–155 (1976)
Garrigós, G., Hernández, E., Martell, J.M.: Wavelets, Orlicz spaces, and greedy bases. Appl. Comput. Harmon. Anal. 24(1), 70–93 (2008)
Krasnosel’skii, M., Rutickii, J.: Convex Functions and Orlicz Spaces. Noordhoff Ltd., Groningen (1961)
Lindenstrauss, J., Tzafriri, L.: Classical Banach Spaces, vol II. Springer, Berlin (1979)
Livshitz, E., Temlyakov, V.: Sparse approximation and recovery by greedy algorithms. IEEE Trans. Inf. Theory 60(7), 3989–4000 (2014)
Maleev, R., Troyanski, S.: On the moduli of convexity and smoothness in Orlicz spaces. Stud. Math. 54(2), 131–141 (1975)
Pawlewicz, A., Wojciechowski, M.: Marcinkiewicz sampling theorem for Orlicz spaces. Positivity 26(3), Paper No. 56 (2022)
Rao, M.M., Ren, Z.D.: Theory of Orlicz Spaces. Monographs and Textbooks in Pure and Applied Mathematics, vol. 146. Marcel Dekker, New York (1991)
Singer, I.: Bases in Banach Spaces I. Springer, Berlin (1970)
Temlyakov, V.N.: Greedy algorithms in Banach spaces. Adv. Comput. Math. 14(3), 277–292 (2001)
Temlyakov, V.N.: Greedy Approximation. Cambridge University Press, Cambridge (2011)
Temlyakov, V.N.: Sparse approximation and recovery by greedy algorithms in Banach spaces. Forum Math. Sigma 2(12), 26 (2014)
Temlyakov, V.N.: Sparse Approximation with Bases. Advanced Courses in Mathematics. CRM Barcelona, Birkhäuser, Springer, Basel (2015)
Temlyakov, V.N.: Multivariate Approximation. Cambridge University Press, Cambridge (2018)
Acknowledgements
Research partially supported by Grant PID2019-105599GB-I00 from Ministerio de Ciencia e Innovación (Spain), and Grants 20906/PI/18 and 21955/PI/22 from Fundación Séneca (Región de Murcia, Spain). The author wishes to thank E. Hernández and D. Kutzarova for useful comments at different stages of this work. The author also thanks two anonymous referees for their careful reading and several useful comments.
Funding
Open Access funding provided thanks to the CRUE-CSIC agreement with Springer Nature.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Vladimir N. Temlyakov.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Garrigós, G. The WCGA in \(L^p(\log L)^{\alpha }\) Spaces. Constr Approx (2023). https://doi.org/10.1007/s00365-023-09664-y
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00365-023-09664-y
Keywords
- Non-linear approximation
- Greedy algorithm
- Uniformly smooth Banach space
- Orlicz space
- Haar system
- Trigonometric system