1 Introduction

We study the smoothness properties of roots of nonnegative Hölder continuous functions on [0, 1]. More precisely, we derive sufficient flatness-type conditions such that if f is in a Hölder space with index \(\beta \), then \(f^\alpha \) has Hölder regularity \(\alpha \beta \) for \(0< \alpha \le 1\).

This question was first studied in the square root case (\(\alpha =1/2\)) by Glaeser [11] and Dieudonné [7], who showed that any nonnegative, twice continuously differentiable function on \(\mathbb {R}\) admits a continuously differentiable square root. This result is sharp in general in the sense that there exist infinitely differentiable functions such that no admissible square root has Hölder index larger than one (Theorem 2.1 in [1]). Flatness conditions thus become necessary for \(\beta >2\) in order to ensure that a \(\beta \)-smooth function has an admissible square root with Hölder index greater than one. Some extensions of the result of Glaeser can be found in Lengyel [13] and Mandai [15]. Macchia [14], Reichard [23], Rainer [21] and Colombini et al. [6] studied what regularity \(f^{1/r}\) has in a neighbourhood of a zero of f, provided that enough derivatives of f vanish at that point.

The key property for additional regularity of roots is flatness, in the sense that whenever f is small then so are its derivatives, and in a series of recent papers, Bony et al. [1,2,3] consider flatness conditions for admissible square roots. Recall that g is an admissible square root of f if \(f=g^2\) (allowing g to switch signs). The additional freedom to pick an admissible square root does not always help, however. Suppose f possesses a converging sequence of local minima \((x_n)_n\) such that \(f(x_n)\) is strictly positive for all n and \(f(x_n)\rightarrow 0\) as \(n\rightarrow \infty .\) The nonnegative square root is then the most regular admissible square root, and there exist infinitely differentiable functions of this form with no admissible square root having Hölder index larger than one [1]. In this article, we restrict to studying roots of nonnegative functions that do not change sign, which we without loss of generality take to be positive.

The main result of this article is stated below, with \(\Vert \cdot \Vert _{C^\beta ([0,1])}\) and \(\Vert \cdot \Vert _{\mathcal {F}^\beta ([0,1])}\) denoting the usual and “flat” Hölder norms on [0, 1], respectively, both of which are defined in Sect. 2.

Theorem 1

For \(\alpha \in (0,1]\), \(\beta >0\) and all nonnegative \(f\in \mathcal {F}^\beta ([0,1])\),

$$\begin{aligned} \Vert f^\alpha \Vert _{C^{\alpha \beta }([0,1])} \le \Vert f^\alpha \Vert _{\mathcal {F}^{\alpha \beta }([0,1])} \le C(\alpha ,\beta ) \Vert f\Vert _{\mathcal {F}^\beta ([0,1])}^\alpha . \end{aligned}$$

In particular, we see that \(f\in \mathcal {F}^\beta \) is a sufficient condition for \(\sqrt{f}\in C^{\beta /2}\), thereby overcoming the usual limitation that \(\sqrt{f}\) need be no more than \(C^1\) in general. For \(\beta \le 1\), bounds on the Hölder or Sobolev norms of \(f^{1/r}\) can be found in Glaeser [11] (\(r=2\)), Colombini and Lerner [5] and Ghisi and Gobbino [9]. For bounds for the roots of more general polynomials see the recent work of Parusiński and Rainer [17,18,19].

The condition \(f\in \mathcal {F}^\beta \) requires a certain notion of “flatness” on the derivatives of f that is different from that considered in [1,2,3]. In particular, it contains functions not covered by their results for which Theorem 1 nonetheless applies [see (2.2) for a simple example]. In Sect. 2, we study the Hölder cone of nonnegative functions satisfying such a flatness constraint and compare it to existing notions of flatness in the literature. We show that the present flatness constraints are only non-trivial for Hölder indices larger than two (Theorem 4).

We also derive bounds on the wavelet coefficients of \(f^\alpha \) that provide additional insight into the local regularity of \(f^\alpha \) (Proposition 1). Note that if \(f\in C^\beta \) is uniformly bounded away from zero then \(f^\alpha \in C^\beta \) (see Lemma 2). This is substantially better than the regularity provided by Theorem 1, which is driven by small function values. The wavelet bounds reflect the local function size and allow one to account for both of these regimes, thereby giving a finer account of the regularity of \(f^\alpha \).

An application of this approach is in nonparametric statistics, where one aims to reconstruct a Hölder function f when observing a noisy version of \(\sqrt{f}\) (cf. [16]). In the context of this problem, one must necessarily restrict to positive square roots and it is not enough to find flatness conditions ensuring that \(\sqrt{f}\) has some regularity: some control of the Hölder norm is also required [22].

2 Flat Hölder smoothness

2.1 Definition and comparison with other flatness constraints

Let \(\lfloor \beta \rfloor \) denote the largest integer strictly smaller than \(\beta .\) For \(\beta >0\) and \(\Omega \subset \mathbb {R}\), define

$$\begin{aligned} |f|_{C^\beta (\Omega )} = \sup _{x\ne y, x,y\in \Omega } \frac{|f^{(\lfloor \beta \rfloor )}(x) - f^{(\lfloor \beta \rfloor )}(y)| }{|x-y|^{\beta - \lfloor \beta \rfloor }} \end{aligned}$$

and let \(C^\beta (\Omega ) = \{ f: \Omega \rightarrow \mathbb {R}\ : \ f^{(\lfloor \beta \rfloor )} \text { exists}, \ \Vert f \Vert _{C^\beta (\Omega )} < \infty \}\) denote the space of \(\beta \)-Hölder continuous functions on \(\Omega \), where \(\Vert f \Vert _{C^\beta (\Omega )} = \Vert f\Vert _\infty + \Vert f^{(\lfloor \beta \rfloor )} \Vert _\infty + |f|_{C^\beta (\Omega )}\). This space is often denoted by \(C^{\lfloor \beta \rfloor , \beta -\lfloor \beta \rfloor }(\Omega ).\) For convenience, we also write \(C^\beta = C^\beta ([0,1])\).

For \(f\in C^\beta ,\) define \(|f|_{\mathcal {F}^\beta }\) to be the infimum over all nonnegative real numbers \(\kappa \) satisfying

$$\begin{aligned} |f^{(j)}(x)|^\beta \le \kappa ^j |f(x)|^{\beta -j} , \ \ \forall x\in [0,1] , \ \ \forall \ 1\le j<\beta , \end{aligned}$$
(2.1)

with \(|f|_{\mathcal {F}^\beta }=0\) for \(\beta \le 1\). This can be written concisely as

$$\begin{aligned} | f |_{\mathcal {F}^\beta } = \max _{1 \le j<\beta } \left( \sup _{x\in [0,1]} \frac{|f^{(j)}(x)|^\beta }{|f(x)|^{\beta -j}} \right) ^{1/j} = \max _{1 \le j <\beta } \Big \Vert |f^{(j)}|^\beta /f^{\beta -j} \Big \Vert _\infty ^{1/j}. \end{aligned}$$

The quantity \(| f |_{\mathcal {F}^\beta }\) measures the flatness of a function near zero. In particular, if f vanishes at some point, then so do all its derivatives. Define

$$\begin{aligned} \Vert f\Vert _{\mathcal {F}^\beta } = \Vert f\Vert _{C^\beta ([0,1])} + |f|_{\mathcal {F}^\beta } \end{aligned}$$

and set

$$\begin{aligned} \mathcal {F}^\beta = \{ f\in C^\beta \ : \ f\ge 0, \ \Vert f\Vert _{\mathcal {F}^\beta }<\infty \}. \end{aligned}$$

This space contains for example the constant functions, any \(C^\beta \)-function bounded away from 0 and those of the form \(f(x)=(x-x_0)^\beta g(x)\) for \(g \ge \varepsilon > 0\) infinitely differentiable.

Let us compare condition (2.1) to the flatness conditions considered in the literature on admissible square roots. For a given function f,  suppose that there exists a continuous function \(\gamma \), vanishing on the set of flat points of f, such that for any positive minimum \(x_0\) of f, \(f''(x_0) \le \gamma (x_0) f(x_0)^{1/2}\). Theorem 3.5 of Bony et al. [1] establishes that this is a necessary and sufficient condition for a four times continuously differentiable function to have a twice continuously differentiable, admissible square root. This is comparable to the flatness seminorm (2.1), which for \(\beta =4\) also gives that \(f''\) should be bounded by \(f^{1/2}.\) Nevertheless, in order to obtain bounds on the norm of \(\sqrt{f},\) constraints on the third derivative of f must also be imposed in our framework.

One can exploit extra regularity by assuming that f and its derivatives vanish at all local minima (e.g., in [1,2,3] for admissible square roots). However, we take a different approach permitting functions to take small nonzero values. In fact, as mentioned in [1], the obstacle preventing the existence of a twice continuously differentiable square root for a general nonnegative four times continuously differentiable function is a converging sequence of small nonzero minima. A sufficient condition for arbitrary even integer \(\beta \ge 4\) was found in Theorem 2.2 in [3]: a \(\beta \)-times continuously differentiable function f has a \(\beta /2\)-times continuously differentiable square root if at each local minimum of f, the function and its derivatives up to order \(\beta -4\) vanish. While this constraint need only hold for the local minima of f, forcing all local minima to be exactly zero can be overly restrictive. Indeed, a converging sequence of small nonzero minima can be allowed if it does so in a controlled way. We now give an example which is not covered by [3], but satisfies (2.1) and hence the conditions of Theorem 1.

Let K be a smooth nonnegative function supported on [0, 1] and positive on (0, 1), for example \(K(x) = \exp \left( -\tfrac{1}{x(1-x)}\right) 1_{(0,1)}(x)\), and set \(K_\sigma (x) = K(x/\sigma )\) for \(\sigma >0\). Consider the function

$$\begin{aligned} f(x) = x^\beta - \varepsilon \sum _{j=1}^{\infty } 1_{\left[ 2^{-j-1},2^{-j} \right] }(x) 2^{-(j+1)\beta } K_\sigma (2^{j+1}x-1), \end{aligned}$$
(2.2)

where \(0<\sigma < 2^{\beta -2}\beta ^{-1} \Vert K'\Vert _\infty /\Vert K\Vert _\infty \) and \(\beta \sigma 2^{1-\beta }/\Vert K'\Vert _\infty < \varepsilon \le 1/(2\Vert K\Vert _\infty )\). Let \(x\in [2^{-j-1},2^{-j}]\). Using the upper bound for \(\varepsilon \), it can be checked that \(f(x) \ge 2^{-1-(j+1)\beta }>0\). Using this inequality, it holds that for \(k < \beta \),

$$\begin{aligned} |f^{(k)}(x)| \le C(\beta ,k) 2^{-j(\beta -k)} + \varepsilon 2^{-(j+1)(\beta -k)} \sigma ^{-k} \Vert K^{(k)}\Vert _\infty \le C(\beta ,k,\sigma ,K) f(x)^{\frac{\beta -k}{\beta }}. \end{aligned}$$

Since \(f\in C^\beta \), it follows that \(\Vert f\Vert _{\mathcal {F}^\beta } \le C(\beta ,\sigma ,K)<\infty \). We now show that f has an infinite sequence of nonzero local minima tending to zero. Since \(f'\) is continuous and \(f'(2^{-j}) = \beta 2^{-j(\beta -1)}>0\) for all \(j\ge 1\), it suffices to prove that every interval \([2^{-j-1},2^{-j}]\) contains a point \(x_j\) with \(f'(x_j) <0\). Differentiating f and rearranging the desired inequality show that a sufficient condition for the existence of such a point is \(\beta < \varepsilon 2^{-(\beta -1)} \sigma ^{-1} \Vert K'\Vert _\infty \), which is equivalent to the lower bound for \(\varepsilon \).

In conclusion, we have shown that f possesses (infinitely many) nonzero local minima, so that the conditions of Theorem 2.2 of [3] are not satisfied, but that \(f \in \mathcal {F}^\beta \) so that Theorem 1 nonetheless applies.

2.2 Basic properties

In a slight abuse of notation, we say that \(\Vert \cdot \Vert \) is a norm on a convex cone A if \(\Vert v\Vert \ge 0\) and \(\Vert v\Vert =0\) if and only if \(v=0,\) \(\Vert \lambda v\Vert = \lambda \Vert v\Vert \) for all \(\lambda >0,\) and \(\Vert v+w\Vert \le \Vert v\Vert +\Vert w\Vert \) for all \(v, w\in A.\) We have thus replaced absolute homogeneity by positive homogeneity. Similarly, we say that \(|\cdot |\) is a seminorm on A if \(|v|\ge 0\) for all \(v\in A\) and both positive homogeneity and the triangle inequality hold on A.

Theorem 2

The space \(\mathcal {F}^\beta \) is a convex cone on which \(\Vert \cdot \Vert _{\mathcal {F}^\beta }\) defines a norm. Moreover, \(\Vert fg\Vert _{\mathcal {F}^\beta } \le C(\beta ) \Vert f\Vert _{\mathcal {F}^\beta } \Vert g\Vert _{\mathcal {F}^\beta }\) for any \(f,g\in \mathcal {F}^\beta \).

Proof

For the first part, it is enough to prove that \(|f|_{\mathcal {F}^\beta }\) is a seminorm. Observe that \(|af|_{\mathcal {F}^\beta } = a|f|_{\mathcal {F}^\beta }\) for \(a>0.\) The triangle inequality \( |f+g|_{\mathcal {F}^\beta }\le \ |f|_{\mathcal {F}^\beta }+\ |g|_{\mathcal {F}^\beta }\) can be restated as

$$\begin{aligned} \big | f^{(j)}(x) + g^{(j)}(x) \big | \le \big ( |f|_{\mathcal {F}^\beta } + |g|_{\mathcal {F}^\beta }\big )^{\frac{j}{\beta }}(f(x)+g(x))^{\frac{\beta -j}{\beta }}, \quad \text {for all} \ x\in [0,1] \end{aligned}$$

and \(j=1,\ldots , \lfloor \beta \rfloor \). We may assume that \(f(x),g(x), |f|_{\mathcal {F}^\beta }, |g|_{\mathcal {F}^\beta }>0,\) since otherwise the result is trivial. With \(r(x) :=f(x)/(f(x)+g(x))\) and \(\gamma := |f|_{\mathcal {F}^\beta }/ (|f|_{\mathcal {F}^\beta }+ |g|_{\mathcal {F}^\beta }),\) and applying Jensen’s inequality to the concave function \(x\mapsto x^{\frac{j}{\beta }},\)

$$\begin{aligned}&\big | f^{(j)}(x) + g^{(j)}(x) \big | \\&\le \ |f|_{\mathcal {F}^\beta }^{\frac{j}{\beta }} f(x)^{\frac{\beta -j}{\beta }} + |g|_{\mathcal {F}^\beta }^{\frac{j}{\beta }} g(x)^{\frac{\beta -j}{\beta }} \\&= \Big [r(x) \Big ( \frac{\gamma }{r(x)}\Big )^{\frac{j}{\beta }}+(1-r(x))\Big (\frac{1-\gamma }{1-r(x)}\Big )^{\frac{j}{\beta }}\Big ] \big ( |f|_{\mathcal {F}^\beta } + |g|_{\mathcal {F}^\beta }\big )^{\frac{j}{\beta }}(f(x)+g(x))^{\frac{\beta -j}{\beta }} \\&\le \big ( |f|_{\mathcal {F}^\beta } + |g|_{\mathcal {F}^\beta }\big )^{\frac{j}{\beta }}(f(x)+g(x))^{\frac{\beta -j}{\beta }}. \end{aligned}$$

For the second statement, we establish that each term in the norm \(\Vert fg\Vert _{\mathcal {F}^\beta }\) is bounded by a constant times \(\Vert f\Vert _{\mathcal {F}^\beta } \Vert g\Vert _{\mathcal {F}^\beta }\). For \(\beta \in (0,1]\), the result follows immediately, so assume that \(\beta >1\). Using that \(x^{1/\beta } + y^{1/\beta } \le 2^{1-1/\beta } (x+y)^{1/\beta }\) for any \(x,y\ge 0\),

$$\begin{aligned} \left| (fg)^{(j)}(x) \right| ^\beta&= \left| \sum _{r=0}^j {j \atopwithdelims ()r} f^{(r)}(x) g^{(j-r)}(x) \right| ^\beta \\&\le \left| \sum _{r=0}^j {j \atopwithdelims ()r} |f|_{\mathcal {F}^\beta }^{\frac{r}{\beta }} |f(x)|^\frac{\beta -r}{\beta } |g|_{\mathcal {F}^\beta }^\frac{j-r}{\beta } |g(x)|^\frac{\beta -j+r}{\beta } \right| ^\beta \\&= \left( |f|_{\mathcal {F}^\beta }^\frac{1}{\beta } |g(x)|^\frac{1}{\beta } + |g|_{\mathcal {F}^\beta }^\frac{1}{\beta } |f(x)|^\frac{1}{\beta } \right) ^{j\beta } |f(x)g(x)|^{\beta -j} \\&\le \left( 2^{\beta -1} (|f|_{\mathcal {F}^\beta } \Vert g\Vert _\infty + |g|_{\mathcal {F}^\beta } \Vert f\Vert _\infty ) \right) ^j |f(x)g(x)|^{\beta -j} , \end{aligned}$$

from which we deduce that \(|fg|_{\mathcal {F}^\beta } \le 2^{\beta -1}\Vert f\Vert _{\mathcal {F}^\beta } \Vert g\Vert _{\mathcal {F}^\beta }\) using (2.1) and also that \(\Vert (fg)^{(\lfloor \beta \rfloor )}\Vert _\infty \le 2^{\beta -1}\Vert f\Vert _{\mathcal {F}^\beta } \Vert g\Vert _{\mathcal {F}^\beta }\). By the triangle inequality and arguing similarly to (3.5) for each term in the sum below, \(|(fg)^{(\lfloor \beta \rfloor )}(x) -(fg)^{(\lfloor \beta \rfloor )}(y)|\) is bounded by

$$\begin{aligned}&\sum _{r=0}^{\lfloor \beta \rfloor } {\lfloor \beta \rfloor \atopwithdelims ()r} \left( |f^{(r)}(x)-f^{(r)}(y)| |g^{(\lfloor \beta \rfloor -r)}(x)| + |g^{(\lfloor \beta \rfloor -r)}(x) - g^{(\lfloor \beta \rfloor -r)}(y)| |f^{(r)}(y)| \right) \\&\quad \quad \le C(\beta ) \Vert f\Vert _{\mathcal {F}^\beta } \Vert g\Vert _{\mathcal {F}^\beta } |x-y|^{\beta - \lfloor \beta \rfloor }, \end{aligned}$$

whence \(|fg|_{C^\beta }\le C(\beta )\Vert f\Vert _{\mathcal {F}^\beta } \Vert g\Vert _{\mathcal {F}^\beta }\). Since \(\Vert fg\Vert _\infty \le \Vert f\Vert _\infty \Vert g\Vert _\infty \), this completes the proof. \(\square \)

Remark 1

The space \(\mathcal {F}^\beta \) inherits a notion of completeness from \(C^\beta \). Suppose that \((f_n)_n \subset \mathcal {F}^\beta \) is a Cauchy sequence in \(C^\beta \) and \(\limsup _{n\rightarrow \infty } \Vert f_n\Vert _{\mathcal {F}^\beta }<\infty .\) Then, the \(C^\beta \)-limit f of \((f_n)\) satisfies \(f\in \mathcal {F}^\beta \) and \(\Vert f_n\Vert _{\mathcal {F}^\beta } \rightarrow \Vert f\Vert _{\mathcal {F}^\beta }.\)

To see the difference between the classical Hölder space \(C^\beta \) and \(\mathcal {F}^\beta ,\) consider the functions \(f_\gamma : [0,1]\rightarrow \mathbb {R},\) \(f_\gamma (x) =x^\gamma ,\) \(\gamma >0.\) It is well known that \(f_\gamma \in C^\beta \) if and only if either \(\gamma \notin \mathbb {N}\) and \(\beta \le \gamma \) or \(\gamma \in \mathbb {N}\) and \(\beta >0.\) The function \(f_\gamma \) thus has arbitrarily large Hölder smoothness if it is a polynomial. Conversely, one can easily check that \(f_\gamma \in \mathcal {F}^\beta \) if and only if \(\beta \le \gamma .\) We next show that the Hölder cones \(\mathcal {F}^\beta \) are nested.

Theorem 3

Let \(0<\beta '\le \beta \). Then

  1. (i)

    \(\mathcal {F}^\beta \subset \mathcal {F}^{\beta '}\) and \(|f|_{\mathcal {F}^{\beta '}}\le |f|_{\mathcal {F}^\beta } \vee \Vert f\Vert _{\infty },\)

  2. (ii)

    if \(f\in \mathcal {F}^\beta \) for \(\beta >1\) and \(\inf _x f(x) =0,\) then \(|f|_{\mathcal {F}^\beta }\ge \Vert f\Vert _{\infty }\) and \(|f|_{\mathcal {F}^{\beta '}}\le |f|_{\mathcal {F}^\beta },\)

  3. (iii)

    if \(f\ge c >0,\) then \(f\in \mathcal {F}^\beta \) if and only if \(f\in C^\beta .\)

Proof

  1. (i)

    It is enough to show that \(|f|_{\mathcal {F}^{\beta '}}\le |f|_{\mathcal {F}^\beta } \vee \Vert f\Vert _{\infty }.\) For \(1\le j<\beta ,\) define \(I_j^{\beta }:=\sup _{x\in [0,1]}|f^{(j)}(x)|^\frac{\beta }{j}/f(x)^\frac{\beta -j}{j}\) and \(x_j^* \in {\mathop {\hbox {argmax}}}_{x\in [0,1]} |f^{(j)}(x)|^{\beta '}/f(x)^{\beta '-j}.\) Assume \(f(x_j^*)>0\) since if \(f(x_j^*)=0\), then the result holds trivially as \(|f^{(j)}(x_j^*)|=0\). If \(|f^{(j)}(x_j^*)|\le f(x_j^*),\) then \(I_j^{\beta '} \le \Vert f\Vert _{\infty }.\) On the contrary, if \(|f^{(j)}(x_j^*)|\ge f(x_j^*),\) then \(I_j^{\beta '} \le I_j^{\beta }.\) This proves \(\mathrm{(i)}.\)

  2. (ii)

    By hypothesis, there exists \(x_0\in [0,1]\) such that \(f(x_0)=0\). By (2.1), this implies that \(f^{(j)}(x_0)=0\) for all \(1\le j<\beta ,\) so that \(\Vert f^{(j-1)}\Vert _\infty \le \Vert f^{(j)}\Vert _\infty .\) This already implies \(\mathrm{(ii)},\) since for \(\beta >1,\)

    $$\begin{aligned} |f|_{\mathcal {F}^\beta } \ge \max _{1\le j\le \lfloor \beta \rfloor } \left( \frac{\Vert f^{(j)}\Vert _\infty ^\beta }{\Vert f\Vert _\infty ^{\beta -j}}\right) ^{\frac{1}{j}}\ge \Vert f\Vert _\infty . \end{aligned}$$
  3. (iii)

    Under the assumptions, \(|f|_{\mathcal {F}^\beta } \le \max _{1\le j<\beta }(c^{j-\beta } \Vert f^{(j)}\Vert _\infty ^\beta )^{1/j}<\infty \). \(\square \)

Part (i) of the previous theorem shows that there are two regimes. If the flatness seminorm dominates the \(L^\infty \)-norm, then \(|\cdot |_{\mathcal {F}^\beta }\) increases in \(\beta ,\) which matches our intuition that the seminorms should become stronger for larger \(\beta .\) For functions that are very flat in the sense that \(|f|_{\mathcal {F}^\beta }\le \Vert f\Vert _{\infty },\) this need not be true. Consider the function \(f(x)=x+q\) on [0, 1] with \(q>0.\) For \(\beta >1,\) \(|f|_{\mathcal {F}^\beta }= q^{1-\beta }\) and thus for functions of low flatness, the seminorm can also decrease in \(\beta .\) The full flatness norms \(\Vert \cdot \Vert _{\mathcal {F}^\beta }\) are, however, unaffected by this since they also involve \(\Vert f\Vert _\infty .\) Statement \(\mathrm{(ii)}\) of the previous theorem says that the low-flatness phenomenon only occurs if the function is bounded away from zero, and this case \(\mathrm{(iii)}\) shows that the flatness seminorm is always finite.

For \(\beta \in (0,2]\), the additional derivative constraint is in fact always satisfied and \(\mathcal {F}^\beta \) contains all nonnegative functions that can be extended to a \(\beta \)-Hölder function on \(\mathbb {R}\).

Theorem 4

Suppose that \(f\in C^\beta (\mathbb {R})\) for \(\beta \in (0,2]\) and \(f\ge 0.\) Let \(f^*\) be the restriction of f to [0, 1]. Then, \(f^*\in \mathcal {F}^\beta \) and

$$\begin{aligned} |f^*|_{\mathcal {F}^\beta } \le 2^\beta |f|_{C^\beta (\mathbb {R})}. \end{aligned}$$

Proof

The result is trivial for \(\beta \le 1\), so assume \(\beta \in (1,2].\) Without loss of generality, we may assume that \(|f|_{C^\beta (\mathbb {R})}=1.\) Suppose that the statement does not hold, that is \(|f^*|_{\mathcal {F}^\beta }> 2^\beta .\) Then there exists \(x\in [0,1]\) such that \(|f'(x)|>2f(x)^{(\beta -1)/\beta }.\) By symmetry it is enough to consider \(f'(x)>2f(x)^{(\beta -1)/\beta }\) and we must also have \(f(x)>0\), since otherwise \(f'(x)=0\) as \(f\in C^\beta (\mathbb {R})\). Set \(t= x- f(x)^{1/\beta }\) and observe that for any \(y\in [t,x],\)

$$\begin{aligned} f'(y)\ge f'(x)- |f'(x)-f'(y)|\ge 2f(x)^{(\beta -1)/\beta } - |x-t|^{\beta -1} \ge f(x)^{(\beta -1)/\beta }. \end{aligned}$$

Consequently,

$$\begin{aligned} f(t) = f(x) -\int _t^x f'(s) \mathrm{d}s \le f(x) - f(x)^{(\beta -1)/\beta } |x-t| = 0. \end{aligned}$$

Since \(f\ge 0\) we must have \(f(t)=0\) but then also \(f'(t)=0\ge f(x)^{(\beta -1)/\beta } >0\), which is a contradiction. This proves the claim. \(\square \)

Up to smoothness \(\beta =2,\) flatness can thus be defined directly without the need to resort to a quantity such as (2.1). This was used in the recent statistical literature, see [20]. To further illustrate the previous result, consider the linear function \(f_1(x)=x.\) While \(f_1\in C^\beta ([0,1])\) for any \(\beta >0\), it has regularity one with respect to the Hölder cones. Indeed, \(f_1\) cannot be extended to a nonnegative function on \(\mathbb {R}\) that is smoother than Lipschitz since the non-negativity constraint induces a kink at zero.

The previous theorem cannot be extended to smoothness indices above \(\beta =2.\) Indeed, the function \(f(x)=(x-1/2)^2\) is at most in \(\mathcal {F}^2 ([0,1]),\) but can be extended to a function in \(C^\beta (\mathbb {R})\) for any \(\beta >0\). The reason is that for smoothness \(\beta \in (1,2],\) any violation of (2.1) must occur at the boundary, since the smoothness and non-negativity constraints together imply (2.1) for interior points. For smoothness indices \(\beta >2,\) the given example shows that there can be points in the interior of [0, 1] for which (2.1) does not hold.

If \(f\in \mathcal {F}^\beta \) then \(f'\) is not necessarily in \(\mathcal {F}^{\beta -1}\), but the following theorem shows that integration viewed as an operator from \(\mathcal {F}^\beta \) to \(\mathcal {F}^{\beta +1}\) is bounded.

Theorem 5

Given \(f\in \mathcal {F}^\beta ,\) write \(F:[0,1]\rightarrow [0,\infty ),\) \(F(x) =\int _0^x f(u) \mathrm{d}u\) for the antiderivative. Then there exists a constant \(C(\beta )\) such that

$$\begin{aligned} \Vert F\Vert _{\mathcal {F}^{\beta +1}} \le C(\beta ) \Vert f\Vert _{\mathcal {F}^\beta }. \end{aligned}$$

In particular, \(F\in \mathcal {F}^{\beta +1}.\)

Proof

It is sufficient to prove the result for \(\Vert f\Vert _{\mathcal {F}^\beta }=1.\) Observe that the Hölder seminorms agree, that is, \(|F|_{C^{\beta +1}} = |f|_{C^\beta }.\) It remains to show that for some \(\kappa \) which depends only on \(\beta ,\) \(|F^{(j)}(x)|\le \kappa ^{\frac{j}{\beta +1}} F(x)^{\frac{\beta +1-j}{\beta +1}},\) for all \(x\in [0,1]\) and all \(j=0,\ldots , \lfloor \beta \rfloor +1.\) Notice that by Lemma 1, \(f(x+h)\ge f(x)/2\) for all \(|h|\le a f(x)^{1/\beta },\) for some positive constant \(a=a(\beta ).\) We firstly prove the case \(j=1.\) We show that

$$\begin{aligned} f(x)^{\frac{\beta +1}{\beta }} \le \tfrac{2}{a} F(x). \end{aligned}$$
(2.3)

If \(af(x)^{1/\beta }\le x,\) then \(f(x)^{\frac{\beta +1}{\beta }}\le \tfrac{x}{a} f(x) \le \tfrac{2}{a}\int _0^x f(u) \mathrm{d}u = \tfrac{2}{a} F(x).\) On the other hand if \(af(x)^{1/\beta }\ge x,\) then \(f(x)^{\frac{\beta +1}{\beta }} \le \tfrac{2}{a} \int _{x-af(x)^{1/\beta }}^x f(u) \mathrm{d}u \le \tfrac{2}{a} F(x).\) This proves (2.3).

Recall that \(\Vert f\Vert _{\mathcal {F}^\beta }=1.\) Together with (2.3), for \(j=2, \ldots , \lfloor \beta \rfloor +1,\) \(|F^{(j)}(x)| =|f^{(j-1)}(x)| \le |f(x)|^{\frac{\beta +1-j}{\beta }}\le (2/a)^{\frac{\beta }{\beta +1}} F(x)^{\frac{\beta +1-j}{\beta +1}}.\) This proves the differential inequalities for \(j=2, \ldots , \lfloor \beta \rfloor +1.\) With (2.3), the statement follows. \(\square \)

2.3 Wavelet bounds for \(f^\alpha \)

Let \(\phi \), \(\psi \) be bounded, compactly supported S-regular scaling function and wavelet, respectively, where \(S\in \mathbb {N}\); in particular we assume that \(\int x^i \psi (x) dx = 0\) for \(i = 0,\ldots ,S-1\) (see, e.g., [12] for more details). Let \(\{\phi _k: k\in \mathbb {Z}\} \cup \{\psi _{j,k} : j = 0,1,\ldots , k \in \mathbb {Z}\}\) be the corresponding compactly supported wavelet basis of \(L^2(\mathbb {R})\). From this we can construct an S-regular wavelet basis of \(L^2([0,1])\) by selecting all basis functions with support intersecting the interval [0, 1] and then correcting for boundary effects as explained in Theorem 4.4 of [4]. There are thus sets of indices \(I_j\) with cardinality bounded by a multiple of \(2^j\) such that \(\{\phi _k: k\in I_{-1}\} \cup \{\psi _{j,k} : j = 0,1,\ldots , k \in I_j\}\) forms an orthonormal basis of \(L^2([0,1]).\)

Recall that \(f^\alpha \in C^\beta \) if \(f\in C^\beta \) is bounded away from zero, rather than \(f^\alpha \in C^{\alpha \beta }\) in general. We obtain two bounds on the wavelet coefficients \(|\langle f^\alpha , \psi _{j,k}\rangle |\) that reflect these two regimes. The first holds for all (jk) and is most useful for those \(\psi _{j,k}\) on whose support the function f is small. The second bound depends explicitly on the local function values and becomes useful when f is large, in which case \(f^\alpha \) typically has more regularity than \(\alpha \beta \) by Lemma 2.

Proposition 1

Suppose that \(\alpha \in (0,1]\), \(\psi \) is S-regular and that \(f\in \mathcal {F}^\beta \) for \(0< \beta < S\). Then

$$\begin{aligned} |\langle f^\alpha , \psi _{j,k} \rangle | \le C(\psi ,\alpha ,\beta ) \Vert f\Vert _{\mathcal {F}^\beta }^\alpha \, 2^{-\frac{j}{2} (2\alpha \beta +1)} . \end{aligned}$$

For \(x_0\in [0,1]\), let \(j(x_0)\) be the smallest integer satisfying

$$\begin{aligned} 2^{j(x_0)}\ge |{\text {supp}}(\psi )|a^{-1} (\Vert f\Vert _{\mathcal {F}^\beta }/f(x_0))^{1/\beta }, \end{aligned}$$

where \(a=a(\beta )\) is the constant in Lemma 1. Then for any wavelet \(\psi _{j,k}\) with \(j\ge j(x_0)\) and \(x_0\in {\text {supp}}(\psi _{j,k})\),

$$\begin{aligned} |\langle f^\alpha , \psi _{j,k} \rangle | \le C(\psi ,\alpha ,\beta )\frac{ \Vert f\Vert _{\mathcal {F}^\beta } }{f(x_0)^{1-\alpha }} 2^{-\frac{j}{2}(2\beta +1)}. \end{aligned}$$

The decay of the wavelet coefficients \(|\langle f^\alpha , \psi _{j,k}\rangle |\) characterizes the Besov norms \(\Vert f^\alpha \Vert _{B_{pq}^\beta }\) (see Chapter 4 of [10] for a full definition), and one could use Proposition 1 to prove Theorem 1. However, while \(B_{\infty \infty }^\beta = C^\beta \) for non-integer \(\beta \), \(B_{\infty \infty }^\beta \) equals the slightly larger Hölder-Zygmund space for integer \(\beta \), resulting in a slight suboptimality when \(\alpha \beta \in \mathbb {N}\). While Theorem 1 provides a more concise statement, the extra local information provided by the above wavelet bounds can be crucial to obtain sharp results, for example in certain nonlinear statistical inverse problems [22]. Recall that \(f\in C^\gamma \) if and only if \(|\langle f,\psi _{j,k} \rangle | \lesssim 2^{-\frac{j}{2} (2\gamma +1)}\) for all (jk) (with a slight correction for \(\gamma \in \mathbb {N}\)). From Proposition 1 we can therefore conclude that on low resolution levels, that is if j is small, the wavelet coefficients have the decay of a \(C^{\alpha \beta }\)-function. The result also shows that on high frequencies the function is \(C^\beta \) and quantifies the resolution level, which depends on the function value, at which the transition between these two cases occurs.

3 Proof of Theorem 1

The proof is based on two technical lemmas. The first lemma identifies a local neighbourhood of each point on which the function is relatively constant.

Lemma 1

Suppose that \(f \in \mathcal {F}^\beta \) with \(\beta >0\) and let \(a = a(\beta )>0\) be any constant satisfying \((e^a-1) + a^\beta / (\lfloor \beta \rfloor !) \le 1/2.\) Then for

$$\begin{aligned} |h| \le a \left( \frac{|f(x)|}{\Vert f\Vert _{\mathcal {F}^\beta }}\right) ^{1/\beta }, \end{aligned}$$

we have

$$\begin{aligned} |f(x+h) - f(x)| \le \frac{1}{2} |f(x)| , \end{aligned}$$

implying in particular, \(|f(x)|/2 \le |f(x+h)| \le 3|f(x)|/2\).

Proof

Without loss of generality, we may assume that \(\Vert f\Vert _{\mathcal {F}^\beta }=1.\) By a Taylor expansion and the definition of \(\mathcal {F}^\beta \), there exists \(\xi \) between x and \(x+h\) such that

$$\begin{aligned} |f(x+h) - f(x)|&= \left| \sum _{j=1}^{\lfloor \beta \rfloor -1} \frac{f^{(j)}(x) h^j}{j!} + \frac{1}{\lfloor \beta \rfloor !} f^{(\lfloor \beta \rfloor )}(\xi )h^{\lfloor \beta \rfloor } \right| \\&\le \sum _{j=1}^{\lfloor \beta \rfloor } \frac{|f(x)|^{\frac{\beta -j}{\beta }} h^j}{j!} + \frac{1}{\lfloor \beta \rfloor !} \left| f^{(\lfloor \beta \rfloor )}(\xi ) - f^{(\lfloor \beta \rfloor )}(x) \right| h^{\lfloor \beta \rfloor } \\&\le \sum _{j=1}^{\lfloor \beta \rfloor } \frac{a^j }{j!} |f(x)| + \frac{a^\beta }{\lfloor \beta \rfloor !} |f(x)| \le \frac{1}{2}|f(x)| . \end{aligned}$$

\(\square \)

For \(f\in \mathcal {F}^\beta \), the function \(f^\alpha \) satisfies a Hölder-type condition with exponent \(\beta \) and locally varying Hölder constant. The following is the key technical result for establishing the smoothness of \(f^\alpha \) and hence the decay of \(|\langle f^\alpha , \psi _{j,k} \rangle |\). The main ingredient in the proof is a careful analysis of Faà di Bruno’s formula, which generalizes the chain rule to higher derivatives [8]:

$$\begin{aligned} \frac{\mathrm{d}^k}{\mathrm{d}x^k} h(f(x)) = \sum _{(m_1,\ldots ,m_k) \in \mathcal {M}_k} \frac{k!}{m_1! \ldots m_k!} h^{\left( \sum _j m_j \right) }(f(x)) \prod _{j=1}^k \left( \frac{f^{(j)}(x)}{j!} \right) ^{m_j} , \end{aligned}$$
(3.1)

where \(\mathcal {M}_k\) is the set of all k-tuples of nonnegative integers satisfying \(\sum _{j=1}^k j m_j = k\). Note that for \(h(x) = x^\alpha \), we have \(h^{(r)}(x) = C_{\alpha ,r} x^{\alpha -r}\) for some \(C_{\alpha ,r} \ne 0\) (except the trivial case \(\alpha =1\)). We can relate the derivatives appearing in (3.1) to f using the seminorm \(|\cdot |_{\mathcal {F}^\beta }\).

Lemma 2

For \(\alpha \in (0,1]\), \(\beta >0\), there exists a constant \(C(\alpha ,\beta )\) such that for all \(f \in \mathcal {F}^\beta \), \(0\le k <\beta \) and \(x,y\in [0,1]\),

$$\begin{aligned} |(f(x)^\alpha )^{( \lfloor \beta \rfloor )} - (f(y)^\alpha )^{(\lfloor \beta \rfloor )} | \le \frac{C(\alpha ,\beta ) (|f|_{C^\beta } + |f|_{\mathcal {F}^\beta }) }{\min (f(x)^{1-\alpha }, f(y)^{1-\alpha })} |x-y|^{\beta - \lfloor \beta \rfloor }, \end{aligned}$$

and

$$\begin{aligned} \left| \frac{\mathrm{d}^k}{\mathrm{d}x^k} \left( f(x)^\alpha \right) \right| \le C(\alpha ,\beta ) \Vert f\Vert _{\mathcal {F}^\beta }^{k/\beta } f(x)^{\alpha -k/\beta }. \end{aligned}$$
(3.2)

Moreover, if \(f \ge \varepsilon > 0\),

$$\begin{aligned} \Vert f^\alpha \Vert _{\mathcal {F}^\beta } \le \frac{C(\alpha ,\beta )}{\varepsilon ^{1-\alpha }} \Vert f\Vert _{\mathcal {F}^\beta }. \end{aligned}$$

Proof

Without loss of generality assume \(f(y) \le f(x)\) and \(|f|_{C^\beta } + |f|_{\mathcal {F}^\beta }=1\). For \(\beta \in (0,1]\), by the mean value theorem,

$$\begin{aligned} |f(x)^\alpha - f(y)^\alpha | \le \sup _{f(y) \le t \le f(x)} \alpha t^{\alpha -1} |f(x) - f(y)| \le \frac{|x-y|^\beta }{f(y)^{1-\alpha }}. \end{aligned}$$

Consider now \(\beta > 1\) and write \(k = \lfloor \beta \rfloor \) (the following also holds for all \(k\le \lfloor \beta \rfloor \) with certain simplifications). We must consider separately the two cases where \(|x-y|\) is small and large. Let \(C(\alpha ,\beta )\) be a generic constant, which may change from line to line.

Suppose first that \(|x-y| \le a f(x)^{1/\beta }\) with a as in Lemma 1. By Lemma 1 we have \(f(y)/2 \le f(x) \le 3f(y)/2\), which will be used freely without mention in the following. The proof is based on a careful analysis of Faà di Bruno’s formula (3.1).

We shall establish the result by proving a Hölder bound for each of the summands in (3.1) individually. Fix a k-tuple \((m_1,\ldots ,m_k) \in \mathcal {M}_k\) and write \(M:=\sum _{j=1}^k m_j.\) By the triangle inequality

$$\begin{aligned} \begin{aligned}&\left| f(x)^{\alpha - M} \prod _{j=1}^k \left( f^{(j)}(x) \right) ^{m_j} - f(y)^{\alpha -M} \prod _{j=1}^k \left( f^{(j)}(y) \right) ^{m_j} \right| \\&\le \left| \left( f(x)^{\alpha -M} - f(y)^{\alpha - M} \right) \prod _{j=1}^k \left( f^{(j)}(x) \right) ^{m_j} \right| \\&\quad \quad + \left| f(y)^{\alpha -M} \left( \prod _{j=1}^k \left( f^{(j)}(x) \right) ^{m_j} - \prod _{j=1}^k \left( f^{(j)}(y) \right) ^{m_j} \right) \right| . \end{aligned} \end{aligned}$$
(3.3)

Before bounding the terms in (3.3), we require some additional estimates. Firstly, by the definition of \(\mathcal {F}^\beta \),

$$\begin{aligned} \left| \prod _{j=1}^k (f^{(j)}(x))^{m_j} \right| \le \prod _{j=1}^k |f(x)|^\frac{(\beta -j)m_j}{\beta } = |f(x)|^{M - k/\beta } . \end{aligned}$$
(3.4)

Secondly, for any function g and integer \(r \ge 1\), we have by the mean value theorem \(g(x)^r - g(y)^r = r g(\xi )^{r-1} g'(\xi ) (x-y)\) for some \(\xi \) between x and y. Noting that \(f(\xi ) \approx f(x) \approx f(y)\), that \(\beta - k \in (0,1]\) and applying the above to \(g(x) = f^{(j^*)}(x)\) with \(r = m_{j^*}\) and \(j^* \in \{ 1,\ldots ,k-1\}\) yields

$$\begin{aligned} \begin{aligned}&\left| \left( f^{(j^*)} (x) \right) ^{m_{j^*}} - \left( f^{(j^*)} (y) \right) ^{m_{j^*}} \right| \\&\le \left| m_{j^*} \left( f^{(j^*)} (\xi _{x,y,j^*}) \right) ^{m_{j^*}-1} f^{(j^*+1)} (\xi _{x,y,j^*}) (x-y) \right| ^{\beta -k}\\&\quad \times \left| \left( f^{(j^*)} (x) \right) ^{m_{j^*}} + \left( f^{(j^*)} (y) \right) ^{m_{j^*}} \right| ^{1-(\beta -k)} \\&\le C(\beta ) m_{j^*}^{\beta -k} |f(\xi _{x,y,j^*}) |^{\frac{(m_{j^*} (\beta -j^*)-1)(\beta -k)}{\beta }} |x-y|^{\beta -k} |f(y)|^{\frac{m_{j^*}(\beta -j^*)(1-\beta +k)}{\beta }} \\&\le C(\beta ) |f(y)|^{\frac{m_{j^*}(\beta -j^*) - (\beta -k)}{\beta }} |x-y|^{\beta -k} . \end{aligned} \end{aligned}$$
(3.5)

Strictly speaking, we cannot invoke (3.5) for \(j^* = k\), since f is only k-times differentiable. However, noting that \(m_k\) can only take values 0 or 1, we see directly from the Hölder continuity of \(f^{(k)}\) that the conclusion of (3.5) holds as well for \(j^* =k\) since we have the bound \(|x-y|^{\beta -k}\). By the same argument as (3.5)

$$\begin{aligned} \begin{aligned} |f(x)^{\alpha - M} - f(y)^{\alpha - M} |&\le C(\alpha ,\beta ) \frac{ |f(y)|^{M - 1-\alpha + k/\beta } |x-y|^{\beta -k} }{ f(x)^{ M -\alpha } f(y)^{ M - \alpha } } \\&\le C(\alpha ,\beta ) |f(y)|^{-M- 1+\alpha + k/\beta } |x-y|^{\beta -k} . \end{aligned} \end{aligned}$$
(3.6)

Using (3.4) and (3.6), the first term in (3.3) is bounded by \(C(\beta ) |f(y)|^{\alpha -1} |x-y|^{\beta -k}\) as required.

For the second term in (3.3), we repeatedly apply the triangle inequality, each time changing the variable in a single derivative. Fix \(j^*\) and define vectors \( (z_j^{j^*} )_{j=1}^k\), \((\tilde{z}_j^{j^*})_{j=1}^k\) that are identically equal to x or y in all entries and differ only in the \(j^*\)-coordinate, where \(z_{j^*}^{j^*} = x\), \(\tilde{z}_{j^*}^{j^*} = y\). Using (3.5)

$$\begin{aligned}&f(y)^{\alpha - M } \left| \prod _{j=1}^k \left( f^{(j)} (z_j^{j^*} ) \right) ^{m_j} - \prod _{j=1}^k \left( f^{(j)} (\tilde{z}_j^{j^*}) \right) ^{m_j} \right| \nonumber \\&= f(y)^{\alpha - M } \prod _{j=1, j\ne j^*}^k \left( f^{(j)} (z_j^{j^*} ) \right) ^{m_j} \left| \left( f^{(j^*)} (x) \right) ^{m_{j^*}} - \left( f^{(j^*)} (y)\right) ^{m_{j^*}} \right| \nonumber \\&\le C(\alpha ,\beta ) f(y)^{\alpha - M} \left( \prod _{j=1,j\ne j^*}^k \left| f (z_j^{j^*} ) \right| ^\frac{(\beta -j)m_j}{\beta } \right) |f(y)|^{\frac{m_{j^*}(\beta -j^*) - (\beta -k)}{\beta }} |x-y|^{\beta -k} \nonumber \\&\le C(\alpha ,\beta ) |f(y)|^{\alpha - M + \sum _j (\beta -j)m_j/\beta - (\beta -k)/\beta } |x-y|^{\beta -k} \nonumber \\&= C(\alpha ,\beta ) |f(y)|^{\alpha -1} |x-y|^{\beta -k} , \end{aligned}$$
(3.7)

where in the last line we have used that \(\sum _j j m_j = k\). By repeatedly applying the triangle inequality and using (3.7), we can bound the second term in (3.3) by

$$\begin{aligned}&f(y)^{\alpha - M } \sum _{j^*=1}^k \left| \prod _{j=1}^k \left( f^{(j)} (z_j^{j^*}) \right) ^{m_j} - \prod _{j=1}^k \left( f^{(j)} (\tilde{z}_j^{j^*}) \right) ^{m_j} \right| \\&\quad \quad \le C(\alpha ,\beta ) |f(y)|^{\alpha -1} |x-y|^{\beta -k} , \end{aligned}$$

thereby completing the proof in the case \(|x-y| \le a f(x)^{1/\beta }\).

Applying Faà di Bruno’s formula (3.1) and (3.4) yields

$$\begin{aligned} \left| \frac{\mathrm{d}^k}{\mathrm{d}x^k} \left( f(x)^\alpha \right) \right| \le \sum _{(m_1,\ldots ,m_k) \in \mathcal {M}_k} C f(x)^{\alpha -M} f(x)^{M - k/\beta } \le C f(x)^{\alpha -k/\beta }. \end{aligned}$$
(3.8)

For \(|x-y| > a f(x)^{1/\beta }\) we thus have

$$\begin{aligned} \left| \left( f(x)^\alpha \right) ^{(k)} - \left( f(y)^\alpha \right) ^{(k)} \right|&\le C(\alpha ,\beta ) \left( f(x)^{\alpha -k/\beta } + f(y)^{\alpha -k/\beta } \right) \\&\le \frac{C(\alpha ,\beta )}{f(y)^{1-\alpha }} f(x)^\frac{\beta -k}{\beta } \le \frac{C(\alpha ,\beta )}{f(y)^{1-\alpha }} |x-y|^{\beta -k}, \end{aligned}$$

as required. This completes the proof for the first statement. Note that (3.2) follows directly from (3.8), since this last expression also holds for all \(0\le k<\beta \).

Suppose now \(f \ge \varepsilon > 0\). It follows immediately from the results that have just been established that \(\Vert f^\alpha \Vert _{C^\beta } \le C \Vert f\Vert _{\mathcal {F}^\beta }/\varepsilon ^{1-\alpha }\). By (3.2),

$$\begin{aligned} \left| \left( f(x)^\alpha \right) ^{(j)} \right| \le C\Vert f\Vert _{\mathcal {F}^\beta }^{j/\beta } f(x)^{\alpha -j/\beta } \left( f(x)/\varepsilon \right) ^{\frac{j(1-\alpha )}{\beta }} = C \left( \Vert f\Vert _{\mathcal {F}^\beta }/\varepsilon ^{1-\alpha } \right) ^\frac{j}{\beta } \left( f(x)^\alpha \right) ^{\frac{\beta -j}{\beta }} \end{aligned}$$

for all \(1\le j<\beta \), so that also \(|f^\alpha |_{\mathcal {F}^\beta } \le C\Vert f\Vert _{\mathcal {F}^\beta }/\varepsilon ^{1-\alpha }\). \(\square \)

Proof of Theorem 1

By rescaling, we may assume that \(\Vert f\Vert _{\mathcal {F}^\beta }=1\). Throughout the proof, C denotes a generic constant which only depends on \(\alpha \) and \(\beta \) and may change from line to line. It is enough to prove \(\Vert f^\alpha \Vert _{\mathcal {F}^{\alpha \beta }} \le C.\) With (3.2), we find that \(\Vert f^\alpha \Vert _\infty + \Vert (f^\alpha )^{(\lfloor \alpha \beta \rfloor )}\Vert _\infty + | f^\alpha |_{\mathcal {F}^{\alpha \beta }} \le C.\) It thus remains to establish \(|f^\alpha |_{C^{\alpha \beta }} \le C.\)

Write \(\alpha \beta = \ell + \sigma \) with \(\ell = \lfloor \alpha \beta \rfloor \) and \(0<\sigma \le 1.\) Let \(x,y \in [0,1]\) with \(x< y\) and define \(\Delta _{\alpha , \ell }(x,y) := |( f(x)^\alpha )^{(\ell )} - ( f(y)^\alpha )^{(\ell )}|.\) By (3.2),

$$\begin{aligned} \Big | \big ( f(x)^\alpha \big )^{(\ell )} \Big | \le C f(x)^{\alpha -\ell /\beta }. \end{aligned}$$
(3.9)

If \(|y-x| \ge a (f(x)\vee f(y))^{1/\beta },\) we find \(\Delta _{\alpha , \ell }(x,y) \le C |x-y|^\sigma .\)

On the contrary, if \(|y-x| \le a(f(x)\vee f(y))^{1/\beta },\) we conclude from Lemma 1 that \(f(x)/2 \le f(z) \le 2f(x)\) for all \(x\le z \le y.\) Consider first the case \(\ell < \lfloor \beta \rfloor .\) With (3.2),

$$\begin{aligned} \Delta _{\alpha , \ell }(x,y) \le \sup _{z\in [x,y]} \Big |\big ( f(z)^\alpha \big )^{(\ell +1)}\Big | |y-x| \le C f(x)^{\alpha -(\ell +1)/\beta } |y-x|. \end{aligned}$$
(3.10)

Write \(\Delta _{\alpha , \ell }(x,y) =\Delta _{\alpha , \ell }(x,y)^{1-\sigma } \Delta _{\alpha , \ell }(x,y)^\sigma \) and bound the first factor using (3.9) and the second factor using (3.10). This shows that \(\Delta _{\alpha , \ell }(x,y)\le C|x-y|^\sigma .\)

Consider finally the case \(\ell = \lfloor \beta \rfloor .\) Let \(\theta = \sigma /(\beta -\ell )\) and observe that \(\Delta _{\alpha , \ell }(x,y) =\Delta _{\alpha , \ell }(x,y)^{1-\theta } \Delta _{\alpha , \ell }(x,y)^\theta \le C |y-x|^\sigma \), bounding the first factor using (3.9) and the second factor using the first assertion of Lemma 2. \(\square \)

Proof of Proposition 1

We may again assume that \(\Vert f\Vert _{\mathcal {F}^\beta }=1\) for both statements. If the wavelet function is S-regular and \(f\in C^\beta ([0,1])\) for \(0< \beta <S,\) then there exists a function g with \(\Vert g\Vert _\infty \le 1,\) such that for any \(x_0\in (0,1),\) we can bound \(| \int f(x) \psi _{j,k}(x) \mathrm{d}x |\) by

$$\begin{aligned} \frac{1}{\lfloor \beta \rfloor !} \left| \int \big [ f^{(\lfloor \beta \rfloor )}(x_0+g(y) (y-x_0))- f^{(\lfloor \beta \rfloor )}(x_0) \big ] (y-x_0)^{\lfloor \beta \rfloor } \psi _{j,k} (y) \mathrm{d}y \right| . \end{aligned}$$

This wavelet bound can be easily proved by Taylor expanding f and using the moment properties of the wavelet function.

For \(\eta >0\), note that \(|f+\eta |_{C^\beta } = |f|_{C^\beta }\) and \(|f+\eta |_{\mathcal {F}^\beta } \le |f|_{\mathcal {F}^\beta }\), since adding a positive constant function to \(f\ge 0\) can only reduce the \(|\cdot |_{\mathcal {F}^\beta }\)-seminorm. Applying Lemma 2 thus yields \(|(f + \eta )^\alpha |_{C^\beta } \le C/\eta ^{1-\alpha }\). Using that \(|(x + \eta )^\alpha - x^\alpha | \le \eta ^\alpha \) for all \(x \ge 0\) and the wavelet bound for \(C^\beta \)-functions, we have

$$\begin{aligned} |\langle f^\alpha , \psi _{j,k} \rangle |&\le \left| \int (f + \eta )^\alpha \psi _{j,k} \right| + \left| \int ((f + \eta )^\alpha - f^\alpha ) \psi _{j,k} \right| \\&\le \frac{C}{\eta ^{1-\alpha }} 2^{-\frac{j}{2}(2\beta +1)} + C'\eta ^\alpha 2^{-j/2} . \end{aligned}$$

Optimizing over \(\eta \) yields \(\eta = 2^{-j\beta }\) and thence the first result.

We show that for all \(j \ge j(x_0)\), the support of any \(\psi _{j,k}\) with \(\psi _{j,k}(x_0) \ne 0\) is contained in the set \(\{ x : f(x_0) /2 \le f(x) \}\). To see this observe that for any such \(\psi _{j,k}\) it holds that \(|{\text {supp}}(\psi _{j,k})| \le 2^{-j}|{\text {supp}}(\psi )| \le a f(x_0)^{1/\beta }\) and so applying Lemma 1 yields that \(f(t) \ge f(x_0) - |f(t) - f(x_0)| \ge f(x_0)/2\) for any \(t \in {\text {supp}}(\psi _{j,k})\). Using this and applying Lemma 2, we have that \(\Vert f^\alpha \Vert _{\mathcal {F}^\beta ({\text {supp}}(\psi _{j,k}))} \le C(\beta )/(f(x_0)/2)^{1-\alpha }\), where the first norm refers to \(f^\alpha \) restricted to \({\text {supp}}(\psi _{j,k})\) with the obvious modification of \(\Vert \cdot \Vert _{\mathcal {F}^\beta }\) to this set. Applying the wavelet bound above to such (jk) then yields the result. \(\square \)