Abstract
Using wavelets methods, Abbaszadeh, Chesneau, Doosti studied the density estimation problem under bias and multiplicative censoring (Stat. Probab. Lett. 82:932-941, 2012), and obtains the convergence rate of wavelet estimators in \(L^{2}\) norm for a density function in Besov space. This paper deals with \(L^{p}\) risk estimation with \(1\leq p<\infty\) based on wavelet bases. Motivated by the work of Youming and Junlian (2014), we construct new estimators: a linear one and a nonlinear adaptive one; an upper bound of wavelet estimators on \(L^{p}\) risk for a density function in Besov space is provided, which generalizes Abbaszadeh et al.’s theorems. It turns out that the nonlinear adaptive estimator obtains faster rate of convergence than the linear one for \(r< p\).
Similar content being viewed by others
1 Introduction and preliminary
1.1 Introductions
The density estimation plays important roles in both statistics and econometrics. This paper considers the density model under bias and multiplicative censoring, which were introduced by Abbaszadeh et al. [1]. Let \(Z_{1}, Z_{2}, \ldots, Z_{n}\) be independent and identically distributed (i.i.d.) random variables of
where \(U_{1}, U_{2}, \ldots, U_{n}\) are unobserved i.i.d. random variables with the common uniform distribution on \([0, 1]\), \(Y_{1}, Y_{2}, \ldots, Y_{n}\) are also unobserved i.i.d. random variables and the density function \(f_{Y}\) is given by
Here, \(\omega(x)>0\) denotes a known weight function, \(f_{X}(x)\) stands for an unknown density function of a random variable X and \(\theta=E(\omega(X))=\int_{0}^{1}\omega(x)f_{X}(x)\,dx\) represents the unknown normalization constant (EX is the expectation of X). We suppose that \(U_{i}\) and \(Y_{i}\) are independent for each \(i\in{1, 2, \ldots, n}\). Our aim is to estimate \(f_{X}\) when only \(Z_{1}, Z_{2}, \ldots, Z_{n}\) are observed.
In particular, when \(\omega(x)=1\), this model reduces to the classical density estimation problem under multiplicative censoring described by Vardi [2], which unifies several well-studied statistical problems, including non-parametric inference for renewal processes, certain non-parametric deconvolution problems and estimation of decreasing densities [2–4]. Many methods were proposed to deal with that problem including a series expansion method [4], the kernel method [5] and wavelet method [1, 6], etc. For the standard biased density model (1.1) (estimating \(f_{X}\) from \(Y_{1}, Y_{2}, \ldots, Y_{n}\)), we refer to [7–9]. However, the estimation of \(f_{X}\) from \(Z_{1}, Z_{2}, \ldots, Z_{n}\) is a new statistical problem which has potential applications in statistics and econometrics. As far as we know, only Abbaszadeh et al. [1] dealt with that problem. By using wavelet method, they considered a convergence rate of estimators in \(L^{2}\) norm over Besov space \(B^{s}_{r,q}\).
It is well known that in many statistical models, the error of estimators is measured in \(L^{p}\) norm [10–12]. In this paper, we consider \(L^{p}\) (\(1< p < \infty\)) risk estimation in Besov space \(B^{s}_{r,q}\) based on wavelet bases. We define a linear estimator and a nonlinear adaptive one motivated by Abbaszadeh et al. and Youming and Junlian’s work. We prove that the nonlinear adaptive estimator achieves a faster rate of convergence than the linear one for \(r< p\). Our results can be considered as an extension of Abbaszadeh et al.’s theorems from \(p=2\) to \(p\in [1, +\infty)\).
Section 1.2 introduces some notations and classical results on wavelets and Besov spaces, which will be used in our discussions; the assumptions on the model and the main results are presented in Section 2. In order to prove our theorems, we show several lemmas in Section 3 and give the proofs in Section 4.
1.2 Some preparations
In recent years, the wavelet method has turned out to be effective for density estimation [1, 6, 11, 12] because of the time and frequency localization, being a fast algorithm in numerical computations. In this subsection, we introduce the wavelet basis of the real line \(\mathbb{R}\) (not necessarily on the fixed interval \([0, 1]\) as in [1]), which will be used in our discussions. Let \(\varphi\in C_{0}^{m}(\mathbb{R})\) be an orthonormal scaling function with \(m>s\). The corresponding wavelet function is denoted by ψ. It is well known that \(\{ \phi_{J,k}, \psi_{j,k}, j\geq J, k\in\mathbb {Z}\}\) constitutes an orthonormal basis of \(L^{2}(\mathbb{R})\), where \(f_{j,k}(x):=2^{j/2}f(2^{j}x-k)\) as in wavelet analysis. Then, for each \(f\in L^{2}(\mathbb{R})\),
where \(\alpha_{J,k}=\int f(x)\phi_{j,k}(x)\,dx\) and \(\beta_{j,k}=\int f(x)\psi_{j,k}(x)\,dx\). Details on wavelet bases can be found in [13].
One of the advantages of wavelet bases is that they can characterize Besov spaces. Throughout the paper, we work within Besov space on a compact subset of the real line \(\mathbb{R}\) (not necessarily on the fixed interval \([0, 1]\) as [1]). To introduce those spaces, we need the well-known Sobolev spaces with integer exponents \(W_{p}^{n}(\mathbb {R}):=\{ f| f\in L^{p}(\mathbb{R}), f^{(n)}\in L^{p}(\mathbb{R})\}\) and \(\|f\|_{W_{p}^{n}}:=\|f\|_{p}+\|f^{(n)}\|_{p}\). Then \(L^{p}(\mathbb{R})\) can be considered as \(W^{0}_{p}(\mathbb{R})\). For \(1\leq p, q\leq\infty\) and \(s=n+\alpha\) with \(\alpha\in(0,1]\), the Besov spaces on \(\mathbb{R}\) are defined by
where \(\omega_{p}^{2}(f, t):= \sup_{|h|\leq t}\|f (\cdot+2h)-2f (\cdot+h)+f (\cdot)\|_{p}\) denotes the smoothness modulus of f and
The associated norm \(\|f\|_{B_{p, q}^{s}}:=\|f\|_{p}+\|t^{-\alpha}\omega _{p}^{2}(f^{(n)}, t)\|_{q}^{\ast}\). It should be pointed out that Besov spaces contain Hölder spaces and Sobolev spaces with non-integer exponents for a particular choice of s, p, and r [13].
The following theorems are fundamental in our discussions.
Theorem 1.1
([14])
Let \(f\in L^{r}(\mathbb {R})\) (\(1\leq r\leq\infty\)), \(\alpha_{J,k}=\int f(x)\phi_{J,k}(x)\,dx\), \(\beta_{j,k}=\int f(x)\psi_{j,k}(x)\,dx\). Then the following assertions are equivalent.
-
(i)
\(f\in B_{r,q}^{s}(\mathbb{R})\), \(s>0\), \(1\leq q\leq\infty\);
-
(ii)
\(\{2^{js}\|P_{j}f-f\|_{r}\}_{j\geq0}\in l^{q}\), where \(P_{j}(x):=\sum_{k\in\mathbb{ Z}}\alpha_{j,k}\phi_{J,k}(x)\) is the projection operator to \(V_{j}\);
-
(iii)
\(\|\alpha_{J,\cdot}\|_{r}+\|\{2^{j(s+1/2-1/r)}\|\beta_{j,\cdot}\| _{r}\}_{j\geq0}\|_{q}< \infty\).
Theorem 1.2
([14])
Let ϕ be a scaling function or a wavelet with \(\theta(\phi):=\sup_{x\in \mathbb{R}}|\phi(x-k)|< \infty\). Then
for \(\lambda=\{\lambda_{k}\}\in l^{p}(\mathbb{Z})\) and \(1\leq p\leq \infty\), where
Here and after, \(A\lesssim B\) denotes \(A\leq CB\) for some constant \(C>0\); \(A\sim B\) stands for both \(A\lesssim B\) and \(B\lesssim A\). Clearly, Daubechies and Meyer’s scaling and wavelet functions satisfy the conditions \(\theta(\phi)<\infty\).
2 Main results
This section is devoted to the statement of our main results. To do that, we make the following assumptions as described in [1]:
-
(A1)
The two density functions \(f_{X}\) and \(f_{Y}\) have the support \([0, 1]\) and \(f_{X}\) belongs to the Besov ball \(B_{r,q}^{s}(H)\) (\(H>0\)) defined as
$$B_{r,q}^{s}(H):=\bigl\{ f\in B_{r,q}^{s}( \mathbb{R}), f\mbox{ is a probability density and }\|f\|_{B_{r,q}^{s}}\leq H \bigr\} . $$ -
(A2)
The density of \(Z_{i}\) is
$$f_{Z}(x)=\int_{x}^{1} \frac{f_{Y}(y)}{y}\,dy. $$ -
(A3)
There exists a constant \(C>0\) such that
$$\sup_{x\in[0,1]} f_{X}(x)\leq C,\qquad \sup _{x\in[0,1]} f_{Z}(x)\leq C. $$ -
(A4)
There exist a constant \(C>0\) and \(c>0\) such that
$$\sup_{x\in[0,1]} \omega(x)\leq C, \qquad \sup_{x\in[0,1]} \omega'(x)\leq C, \qquad \inf_{x\in[0,1]} \omega(x)\geq c. $$
To introduce the wavelet estimator, we define the operator T by
for \(h\in C^{1}(\mathbb{R})\), the function set of all differential functions on \(\mathbb{R}\). Then the linear estimator is given as follows:
where \(j_{0}\) is chosen such that \(2^{j_{0}}\sim n^{\frac{1}{2s+3}}\) and \(\wedge:=\{k\in\mathbb{Z}, \operatorname{supp} f_{X} \cap \operatorname{supp} \phi _{j_{0}, k}\neq\varnothing\}\).
To obtain a nonlinear estimator, we take \(j_{0}\) and \(j_{1}\) such that \(2^{j_{1}}\sim\frac{n}{\ln n}\) and \(2^{j_{0}}\sim n^{\frac {1}{2m+3}}\) with \(m>s\). By definition
are the estimators of \(\alpha_{j, k}=\int f(x)\phi_{j, k}(x)\,dx\) and \(\beta_{j, k}=\int f(x)\psi_{j, k}(x)\,dx\), respectively, with
Then the nonlinear estimator is given by
where \(\wedge:=\{k\in\mathbb{Z}, \operatorname{supp} f_{X}\cap \operatorname{supp} \phi _{j_{0}, k}\neq\varnothing\}\), \(\wedge_{j}:=\{k\in\mathbb{Z}, \operatorname{supp} f_{X}\cap \operatorname{supp} \psi_{j, k}\neq\varnothing\}\) and \(1_{D}\) denotes the indicator function on the set D with \(\lambda:=T2^{j}\sqrt{\frac{\ln n}{n}}\).
Remark 2.1
From the definition of \(\hat {f}^{\mathrm{non}}_{n}\), we find that the nonlinear estimator has the advantage of being adaptive, since it does not depend on the indices s, r, q, and H in its construction.
Remark 2.2
The definitions of \(\hat {f}^{\mathrm{lin}}\) and \(\hat{f}^{\mathrm{non}}\) are essentially same as in [1]. However, the selection of \(j_{0}\) and \(j_{1}\) is different from that in [1] and the wavelet functions are defined on the real line \(\mathbb{R}\) not necessarily on \([0, 1]\).
Then we have the following approximation result, which extends Abbaszadeh et al.’s theorems [1] from \(p=2\) to \(p\in[1, +\infty)\).
Theorem 2.1
Let \(f_{X}(x)\in B_{r,q}^{s}(H)\) (\(s>\frac{1}{r}\), \(r,q\geq1\)) and \(\hat{f}^{\mathrm{lin}}\) be the estimator defined by (2.1). If (A1)-(A4) hold, then for each \(1\leq p<\infty\), \(s'=s-(1/r-1/p)_{+}\), and \(x_{+}:=\max(x,0)\),
Remark 2.3
The condition \(s>\frac{1}{p}\) seems natural, since \(B^{s}_{r,q}(\mathbb{R})\subseteq C(\mathbb{R})\) for \(sp>1\), where \(C(\mathbb{R})\) denotes the function set of all continuous functions on \(\mathbb{R}\).
Remark 2.4
If \(r\geq2\) and \(p=2\), then Abbaszadeh et al.’s Theorem 4.1 [1] follows directly from our theorem, in this case \(s'=s\). That is, Theorem 2.1 extends the corresponding theorem of [1] from \(p=2\) to \(p\in[1, +\infty)\).
Remark 2.5
When \(\omega(x)=1\) and \(\theta =E(\omega(X))=1\), the model reduces to the standard multiplicative censoring one considered by Abbaszadeh et al. [6]. In [6], they estimate the convergence rate of wavelet estimators in \(L_{p}\) norm for a density and its derivatives in Besov space. Our result is consistent with Theorem 4.1 [6] taken with \(m=0\).
Theorem 2.2
Let \(f_{X}(x)\in B_{r,q}^{s}(H)\) (\(\frac{1}{r} < s < m\), \(r, q\geq1\)) and \(\hat {f}^{\mathrm{non}}_{n}\) be the estimator given by (2.3). If (A1)-(A4) hold, then there exists \(C>0\) such that for each \(1\leq p<\infty\) and \(\alpha :=\min\{\frac{s}{2s+3}, \frac{s-\frac{1}{r}+\frac{1}{p}}{2(s-\frac {1}{r})+3}\}\),
Remark 2.6
When \(p=2\) and \(r\geq2\) or \(\{ 1\leq r<2, s>3/r\}\), our result is the exactly same as Theorem 4.2 [1], ignoring the log factor. In this case, \(\alpha=\frac{s}{2s+3}\). In other words, our theorem can be considered as an extension of Theorem 4.2 in [1].
Remark 2.7
When \(\omega(x)=1\) and \(\theta =E(\omega(X))=1\), our result coincides with Theorem 4.2 [6] taken with \(m=0\), ignoring the log factor. In this case, the model reduces to the standard density estimation problem under multiplicative censoring.
Remark 2.8
When \(r< p\), the nonlinear estimator attains a better rate of convergence than that of the linear one, due to \(\frac{s'p}{2s'+3}<\alpha p\) because of \(\frac {s'p}{2s'+3}<\frac{sp}{2s+3}\) and \(\frac{s'p}{2s'+3}<\frac{(s-\frac {1}{r}+\frac{1}{p})p}{2(s-\frac{1}{r})+3}\). When \(r\geq p\), the nonlinear estimator does the same rate of convergence to that of the linear one, i.e., \(n^{-\frac{sp}{2s+3}}\), ignoring the log factor. However, taking into account that the nonlinear estimator is adaptive, it is preferable to the linear one in the estimation of \(f_{X}\).
3 Lemmas
We present several important lemmas in this section, which will be needed for the proofs of our main theorems. To show Lemma 3.1, we need Rosenthal’s inequality [14].
Rosenthal’s inequality
Let \(X_{1}, X_{2}, \ldots, X_{n}\) be independent random variables such that \(E(X_{i})=0\) and \(E |x_{i} |^{p}<\infty\) (\(i=1,2,\ldots, n\)). Then
Lemma 3.1
Let \(\hat{\alpha}_{j,k}\), \(\hat {\beta}_{j,k}\) be defined by (2.2). If (A1)-(A4) hold, then there exists a constant \(C>0\) such that
for \(1\leq p<\infty\) and \(2^{j}\leq n\).
Remark 3.1
When \(p=4\), Lemma 3.1 reduces to Proposition 4.1 in [1].
Proof of Lemma 3.1
One only proves the first inequality, the second one is similar. Clearly,
By (A1) and (A3), \(|\alpha_{j,k}|\leq\int_{0}^{1}f_{X}(x)|\phi _{j,k}(x)|\,dx\lesssim\int_{0}^{1}|\phi_{j,k}(x)|\,dx\lesssim1\). On the other hand, \(c\leq|\theta|= |E (\omega(X) ) |= |\int_{0}^{1}\omega(x)f_{X}(x)\,dx |\leq C\) and \(|\hat{\theta }|\lesssim1\) thanks to (A4). Therefore,
To estimate \(T_{1}\), one defines \(\xi_{i}:=\theta T(\phi _{j,k})(Z_{i})-\alpha_{j,k}\). Then \(T_{1}=E |\frac{1}{n}\sum_{i=1}^{n}\xi_{i} |^{p}\) and \(E(\xi_{i})=0\) by Lemma 4.2 in [1]. By the definition of the operator T,
Moveover, \(E |\theta T(\phi_{j,k})(Z_{i}) |^{p}\lesssim\int_{0}^{1} ( |\phi_{j,k}(x) |^{p}+ |\phi_{j,k}'(x) |^{p} )\,dx\lesssim2^{j(\frac{3}{2}p-1)}\) due to (A3) and (A4). Note that \(|\alpha_{j,k}|^{p}\lesssim1\). Then for \(p\geq1\),
This with Rosental’s inequality leads to
Using the assumption \(2^{j}\leq n\), one obtains \(T_{1}=E |\frac {1}{n}\sum_{i=1}^{n}\xi_{i} |^{p}\lesssim2^{pj}n^{-p/2}\). To end the proof, one needs only to show
Denote
Then \(E|\eta_{i}|^{p}\leq C\) due to (A4). By Theorem 4.1 [1], \(\eta_{1}, \eta_{2}, \ldots, \eta_{n}\) are i.i.d. and \(E(\eta_{i})=0\). Then it follows from Rosental’s inequality that
for \(1\leq p\leq2\) and
for \(p\geq2\), which proves the desired conclusion (3.2). This finishes the proof of Lemma 3.1. □
The well-known Bernstein inequality [14] is needed in order to prove Lemma 3.2.
Bernstein’s inequality
Let \(X_{1}, X_{2}, \ldots, X_{n}\) be i.i.d. random variables with \(E(X_{i})=0\), \(\|X_{i}\|_{\infty}\leq M\). Then, for each \(\gamma>0\),
Lemma 3.2
Let \(\hat{\beta}_{j,k}\) be defined by (2.2). If \(2^{j}\leq n/\ln n\) and (A1)-(A4) hold, then for each \(\varepsilon>0\), there exists \(T>0\) such that
Proof
By the definition of \(\hat{\beta}_{j,k}\),
Then
The proof of Lemma 3.1 shows \(|\hat{\theta}|\leq C\) and \(c\leq|\theta |\leq C\). By (A1) and (A3), \(|\beta_{j,k}|= |\int_{0}^{1}f_{X}(x)\psi_{j,k}(x)\,dx |\lesssim\int_{0}^{1} |\psi _{j,k}(x) |\,dx\lesssim1\). Then (3.4) reduces to
Furthermore,
where
and
By (3.5), one needs only to prove
for the desired conclusion (3.3).
To estimate \(I_{1}\), one defines \(U_{i}=\theta T(\psi_{j, k})(Z_{i})-\beta_{j, k}\). Then
by the definition of the operator T. Using (A4), \(|T(\psi_{j, k})(x)|\lesssim|\psi_{j, k}(x)|+|(\psi_{j, k})'(x)|\lesssim2^{\frac {3}{2}j}\) for \(x\in[0,1]\) and
due to \(|\theta|\lesssim1\) and \(|\beta_{j, k}|\lesssim1\). Moreover, using (3.1) with ψ instead of ϕ and \(p=2\), one obtains \(E |U_{i}|^{2}\lesssim2^{2j}\). Because \(U_{1}, U_{2}, \ldots, U_{n}\) are i.i.d. and \(E(U_{i})=0\) (\(i=1, 2, \ldots, n\)) thanks to Lemma 4.2 [1], Bernstein’s inequality tells us that
with \(\gamma=\frac{T}{4C}2^{j}\sqrt{\frac{\ln n}{n}}\). It is easy to see that \(\frac{n\gamma^{2}}{2[E(U_{i}^{2})+\frac{\gamma}{3}\|U\|_{\infty}]}\geq \frac{n\frac{T^{2}}{16C^{2}}2^{2j}\frac{\ln n}{n}}{2(2^{2j}+\frac {T}{12C}2^{\frac{5}{2}j}\sqrt{\frac{\ln n}{n}})}\geq\frac{T^{2}\ln n}{32C^{2}(1+\frac{T}{12C})}\) because of \(2^{\frac{j}{2}}\sqrt{\frac {\ln n}{n}}\leq1\) by the assumption \(2^{j}\leq\frac{n}{\ln n}\). Note that \(\ln n> j\ln2\) due to \(n\geq2^{j}\ln n>2^{j}\). Hence, \(\frac {n\gamma^{2}}{2[E(U_{i}^{2})+\frac{\gamma}{3}\|U\|_{\infty}]}\geq\frac {T^{2}\ln2}{32C^{2}(1+\frac{T}{12C})}j\). One chooses \(T>0\) such that \(\frac{T^{2}\ln2}{32C^{3}(1+\frac{T}{12C})}>\varepsilon\). Then \(\frac {T^{2}\ln2}{32C^{2}(1+\frac{T}{12C})}>\varepsilon\) due to \(C\geq1\) and \(I_{1}\lesssim \exp(-\frac{T^{2}\ln2}{32C^{2}(1+\frac {T}{12C})}j)\lesssim 2^{-\varepsilon j}\), which shows (3.6) for \(i=1\).
Next, one estimates \(I_{2}\): Define \(W_{i}:=\frac{\omega (Z_{i})-Z_{i}\omega'(Z_{i})}{\omega^{2}(Z_{i})}-\frac{1}{\theta}\). Then \(W_{1}, W_{2}, \ldots, W_{n}\) are i.i.d. and \(E(W_{i})=0\). On the other hand, (A3) implies \(|W_{i}|\leq C\) and \(E|W_{i}|^{2}\leq C\). Applying Bernstein’s inequality, one obtains
with \(\gamma=\frac{T}{4C}2^{j}\sqrt{\frac{\ln n}{n}}\). Note that \(\frac{n\gamma^{2}}{2[E(W_{i}^{2})+\frac{\gamma}{3}\|W\|_{\infty}]}\geq \frac{n\frac{T^{2}}{16C^{2}}2^{2j}\frac{\ln n}{n}}{2(C+\frac {TC}{12C}2^{j}\sqrt{\frac{\ln n}{n}})}\geq\frac{\frac {T^{2}}{16C^{2}}2^{j}\ln n}{2C(1+\frac{T}{12C})}\geq\frac{T^{2}\ln 2}{32C^{3}(1+\frac{T}{12C})}j\) because of \(\sqrt{\frac{\ln n}{n}}\leq1\) and \(\ln n >j\ln2\). The desired conclusion (3.6) (\(i=2\)) follows by taking \(T>0\) such that \(\frac {T^{2}\ln2}{32C^{3}(1+\frac{T}{12C})}>\varepsilon\). This completes the proof of Lemma 3.2. □
4 Proofs
This section is devoted to the proof of Theorems 2.1 and 2.2, based on the knowledge of Section 3. We begin with the proof of Theorem 2.1.
Proof of Theorem 2.1
Clearly, \(\hat {f}^{\mathrm{lin}}_{n}-f^{X}=(\hat {f}^{\mathrm{lin}}_{n}-P_{j_{0}}f_{X})+(P_{j_{0}}f_{X}-f_{X})\) and
It follows from the proof of Theorem 4.1 [15] that
due to \(2^{j_{0}}\sim n^{\frac{1}{2s'+3}}\). By (4.1) and (4.2), it is sufficient to show
for the conclusion of Theorem 2.1. By the definition of \(\hat{f}^{\mathrm{lin}}_{n}\), \(\hat{f}^{\mathrm{lin}}_{n}-P_{j_{0}}f_{X}=\sum_{k\in\wedge}(\hat{\alpha }_{j_{0}, k}-\alpha_{j_{0}, k})\phi_{j_{0}, k}\). Then \(\|\hat {f}^{\mathrm{lin}}_{n}-P_{j_{0}}f_{X}\|_{p}^{p}\lesssim2^{j_{0}(\frac {1}{2}p-1)}\sum_{k\in\wedge}|\hat{\alpha}_{j_{0},k}-\alpha _{j_{0},k}|^{p}\) thanks to Theorem 1.2. This with Lemma 3.1 and the choice of \(j_{0}\) leads to
which is the desired conclusion (4.3). This finishes the proof of Theorem 2.1. □
Next, we prove Theorem 2.2.
Proof of Theorem 2.2
It is sufficient to prove the case \(r\leq p\). In fact, when \(r>p\), \(\hat{f}^{\mathrm{non}}_{n}\) has compact support because of ϕ, ψ, and f having the same property. Then
using the Hölder inequality. For \(f_{X}\in B_{r,q}^{s}(H)\), using Theorem 2.2 for the case \(r=p\), one has
and
Now, one estimates the case \(r\leq p\). By the definition of \(\hat{f}_{n}^{\mathrm{non}}\),
Then
From the proof of Theorem 2.1, one knows
Note that \(2^{j_{0}}\sim n^{\frac{1}{2m+3}}\), \(2^{j_{1}}\sim\frac {n}{\ln n}\), and \(\alpha=\min\{\frac{s}{2s+3},\frac{s-\frac{1}{r}+\frac {1}{p}}{2(s-\frac{1}{r})+3}\}\leq s-\frac{1}{r}+\frac{1}{p}=s'\) thanks to \(s>\frac{1}{r}\). Then
To estimate \(E\|\sum_{j=j_{0}}^{j_{1}}\sum_{k\in \wedge_{j}} (\hat{\beta}_{j,k}1_{\{|\hat{\beta}_{j,k}|>\lambda\} }-\beta_{j,k} )\psi_{j,k}\|_{p}^{p}\), one defines
where
Then \(E\|\sum_{j=j_{0}}^{j_{1}}\sum_{k\in\wedge_{j}} (\hat{\beta}_{j,k}1_{\{|\hat{\beta}_{j,k}|>\lambda\}}-\beta_{j,k} )\psi_{j,k}\|_{p}^{p}\lesssim\sum_{i=1}^{4}E\|T_{i}\|_{p}^{p}\). By (4.4) and (4.5), it is sufficient to show
for the conclusion of Theorem 2.2.
To prove (4.6) for \(i=1\), applying Theorem 1.2, one has
due to the fact that \(\{|\hat{\beta}_{j, k}|> \lambda,|\beta_{j, k}|<\lambda/2 \}\subseteq \{ |\hat{\beta}_{j, k}-\beta_{j, k} |\geq\lambda/2 \}\). By the Hölder inequality,
This with Lemma 3.1 and Lemma 3.2 leads to
thanks to \(j_{1}-j_{0}\sim\ln n\) by the choice of \(j_{0}\) and \(j_{1}\). Take ε such that \(\varepsilon>3 p\). Then \(E\|T_{1}\| _{p}^{p}\lesssim(\ln n)^{p-1}n^{-\frac{p}{2}}2^{\frac{3}{2}pj_{0}} \lesssim(\ln n)^{p-1}n^{-\frac{ps}{2s+3}}\lesssim(\ln n)^{p}n^{-\alpha p}\) due to the choice of \(j_{0}\) and \(\alpha\leq\frac {s}{2s+3}\). That is, (4.6) holds for \(i=1\).
To show (4.6) for \(i=3\), one uses the fact that \(\{|\hat{\beta}_{j, k}|\leq\lambda, |\beta_{j, k}|>2\lambda \} \subseteq \{|\hat{\beta}_{j, k}-\beta_{j, k}|\geq\lambda/2 \}\). Hence,
thanks to Theorem 1.2. When \(|\hat{\beta}_{j, k}|\leq\lambda< |\beta _{j, k}|/2\), \(|\hat{\beta}_{j, k}-\beta_{j, k}|\geq|\beta_{j, k}|-|\hat {\beta}_{j, k}|>|\beta_{j, k}|/2>\lambda\). Then
due to Theorem 1.2. The same arguments as above shows \(E\|T_{3}\| _{p}^{p}\lesssim(\ln n)^{p}n^{-\alpha p}\), which is the desired conclusion (4.6) with \(i=3\).
In order to estimate \(E\|T_{2}\|_{p}^{p}\) and \(E\|T_{4}\| _{p}^{p}\), one defines
Recall that \(2^{j_{0}}\sim n^{\frac{1}{2m+3}}\), \(2^{j_{1}}\sim\frac {n}{\ln n}\), and \(\alpha:=\min\{ \frac{s}{2s+3}, \frac{s-\frac{1}{r}+\frac {1}{p}}{2(s-\frac{1}{r})+3}\}\). Then \(\frac{1-2\alpha}{3}\geq\frac{1}{2s+3}>\frac{1}{2m+3}\) and \(\frac {\alpha}{s-\frac{1}{r}+\frac{1}{p}}\leq\frac{1}{2(s-\frac {1}{r})+3}\leq1\). Hence, \(2^{j_{0}}\leq2^{j_{0}^{\ast}}\) and \(2^{j_{1}^{\ast}}\leq 2^{j_{1}}\). Moreover, a simple computation shows \(\frac{1-2\alpha }{3}\leq\frac{\alpha}{s-\frac{1}{r}+\frac{1}{p}}\), which implies \(2^{j_{0}^{\ast}}\leq2^{j_{1}^{\ast}}\).
One estimates \(E\|T_{2}\|_{p}^{p}\) by dividing \(T_{2}\) into
Then \(E\|t_{1}\|_{p}^{p}\lesssim(j_{1}-j_{0}+1)^{p-1}\sum_{j=j_{0}}^{j_{0}^{\ast}}2^{j(\frac{1}{2}p-1)}\sum_{k\in\wedge _{j}}E |\hat{\beta}_{j, k}-\beta_{j, k}|^{p}\) due to Theorem 1.2. This with Lemma 3.1 and the definition of \(j_{0}^{\ast}\) leads to
To estimate \(E\|t_{2}\|_{p}^{p}\), one observes that \(1_{\{|\hat{\beta}_{j, k}|>\lambda, |\beta_{j, k}|\geq\lambda/2\}}\leq 1_{\{|\beta_{j, k}|\geq\lambda/2\}}\leq(\frac{|\beta_{j, k}|}{\lambda /2})^{r}\). Then it follows from Theorem 1.2 that
By Lemma 3.1, \(E |\hat{\beta}_{j, k}-\beta_{j, k}|^{p}\leq n^{-\frac {p}{2}}2^{pj}\). On the other hand, \(\|\beta_{j, \cdot}\|_{r}\leq 2^{-j(s+\frac{1}{2}-\frac{1}{r})}\) for \(f\in B^{s}_{r,q}(H)\) due to Theorem 1.1. Then (4.9) reduces to
Substituting \(\lambda=T2^{j}\sqrt{\frac{\ln n}{n}}\) into the above inequality, one gets
due to \(j_{1}-j_{0}\sim\ln n\). Denote \(\theta:=sr+\frac{3}{2}r-\frac {3}{2}p\). When \(\theta>0\), \(r>\frac{3p}{2s+3}\), and
thanks to the definition of \(2^{j_{0}^{\ast}}\).
Moreover, (4.10) also holds for \(\theta\leq0\). In fact, the same analysis as (4.9) produces
where \(r_{1}:=(1-2\alpha)p\). When \(\theta\leq0\), \(\alpha=\frac{s-\frac {1}{r}+\frac{1}{p}}{2(s-\frac{1}{r})+3}\leq\frac{s}{2s+3}\), and \(r\leq \frac{3p}{2s+3}\leq(1-2\alpha)p=r_{1}\). Then \(\|\beta_{j, \cdot}\| _{r_{1}}\leq\|\beta_{j, \cdot}\|_{r}\leq2^{-j(s-\frac{1}{r}+\frac {1}{2})}\) for \(f\in B^{s}_{r, q}(H)\) thanks to Theorem 1.1. Using Lemma 3.1 and the definition of λ, one has
Note that \(\frac{3}{2}p-1-(s-\frac{1}{r}+\frac{3}{2})r_{1}=0\) because of \(r_{1}=(1-2\alpha)p\) and \(\alpha=\frac{s-\frac{1}{r}+\frac {1}{p}}{2(s-\frac{1}{r})+3}\). Hence, \(E\|t_{2}\|_{p}^{p}\lesssim(\ln n)^{p-\frac{1}{2}r_{1}}n^{\frac {1}{2}(r_{1}-p)}\lesssim(\ln n)^{\frac{1}{2}p}(\frac{\ln n}{n})^{\alpha p}\), which shows (4.10). Combing (4.7), (4.8), and (4.10), one obtains the desired conclusion (4.6) with \(i=2\).
To end the proof, it is sufficient to estimate \(E\|T_{4}\| _{p}^{p}\). When \(\theta> 0\), one splits \(T_{4}\) into
Since \(|\beta_{j, k}|1_{\{|\hat{\beta}_{j, k}|\leq2\lambda, |\beta_{j, k}|\leq2 \lambda\}}\leq2|\lambda|\), \(E\|e_{1}\|_{p}^{p}\lesssim(j_{1}-j_{0}+1)^{p-1}\sum_{j=j_{0}}^{j_{0}^{\ast}}2^{j(\frac{1}{2}p-1)}2^{j}|\lambda|^{p}\) due to Theorem 1.2. Note that \(\lambda=T 2^{j}\sqrt{\frac{\ln n}{n}}\), \(2^{j_{0}^{\ast}}\sim(\frac{n}{\ln n})^{\frac{1-2\alpha}{3}}\), and \(\alpha=\frac{s}{2s+3}\) when \(\theta>0\). Then
To estimate \(E\|e_{2}\|_{p}^{p}\) with \(e_{2}=\sum_{j=j_{0}^{\ast}+1}^{j_{1}}\sum_{k\in\wedge_{j}}\beta_{j, k}\psi _{j, k}(x)1_{\{|\hat{\beta}_{j, k}|\leq\lambda, |\beta_{j, k}|\leq2 \lambda\}}\), one uses the fact that \(1_{\{|\hat{\beta}_{j, k}|\leq \lambda, |\beta_{j, k}|\leq2 \lambda\}}\leq(\frac{2\lambda}{|\beta _{j, k}|})^{p-r}\) because of \(r\leq p\). Then
due to Theorem 1.2. By \(f\in B^{s}_{r, q}(H)\) and Theorem 1.1, \(\|\beta_{j, \cdot}\| _{r}\leq2^{-j(s-\frac{1}{r}+\frac{1}{2})}\). Furthermore,
In the last inequality, one used the assumption \(2^{j_{0}^{\ast}}\sim (\frac{n}{\ln n})^{\frac{1-2\alpha}{3}}\) and \(\alpha=\frac{s}{2s+3}\) for \(\theta> 0\). This with (4.11) and (4.12) leads to
Then one needs only to show that (4.15) holds for \(\theta\leq0\). Similarly, one divides \(T_{4}\) into
For the first sum, proceeding as (4.13) and (4.14), one has \(E\|e_{1}^{\ast}\|_{p}^{p}\lesssim(j_{1}-j_{0}+1)^{p-1} (\frac{\ln n}{n})^{\frac{p-r}{2}}\sum_{j=j_{0}}^{j_{1}^{\ast}}2^{-j(sr+\frac {3}{2}r-\frac{3}{2}p)} \lesssim(j_{1}-j_{0}+1)^{p-1}(\frac{\ln n}{n})^{\frac{p-r}{2}}2^{-j_{1}^{\ast}(sr+\frac{3}{2}r-\frac{3}{2}p)}\). Note that \(j_{1}-j_{0}\sim\ln n\) and \(2^{j_{1}^{\ast}}\sim(\frac {n}{\ln n})^{\frac{\alpha}{s-\frac{1}{r}+\frac{1}{p}}}\). Then \(E\| e_{1}^{\ast}\|_{p}^{p}\lesssim(\ln n)^{p-1}(\frac{\ln n}{n})^{\alpha p}\) due to \(\alpha=\frac{s-\frac{1}{r}+\frac{1}{p}}{2(s-\frac{1}{r})+3}\) for \(\theta\leq0\).
To estimate the second sum, using Theorem 1.2,
By \(f\in B^{s}_{r, q}(H)\) and Theorem 1.1, \(\|\beta_{j, \cdot}\| _{p}\leq\|\beta_{j, \cdot}\|_{r}\lesssim2^{-j(s-\frac{1}{r}+\frac {1}{2})}\). Hence, \(E\|e_{2}^{\ast}\|_{p}^{p}\lesssim (j_{1}-j_{0}+1)^{p-1}\sum_{j=j_{1}^{\ast}+1}^{j_{1}}2^{-j(s-\frac {1}{r}+\frac{1}{p})p}\lesssim(j_{1}-j_{0}+1)^{p-1}2^{-j_{1}^{\ast }(s-\frac{1}{r}+\frac{1}{p})p}\). By the choice of \(j_{1}^{\ast}\), \(E\|e_{2}^{\ast}\|_{p}^{p}\lesssim(\ln n)^{p-1}(\frac{\ln n}{n})^{\alpha p}\) because of \(\alpha=\frac{s-\frac{1}{r}+\frac {1}{p}}{2(s-\frac{1}{r})+3}\), when \(\theta\leq0\). Then the desired conclusion (4.15) follows. This completes the proof of Theorem 2.2. □
References
Abbaszadeh, M, Chesneau, C, Doosti, H: Nonparametric estimation of a density under bias and multiplicative censoring via wavelet methods. Stat. Probab. Lett. 82, 932-941 (2012)
Vardi, Y: Multiplicative censoring, renewal processes, deconvolution and decreasing density: nonparametric estimation. Biometrika 76, 751-761 (1989)
Vardi, Y, Zhang, CH: Large sample study of empirical distributions in a random multiplicative censoring model. Ann. Stat. 20, 1022-1039 (1992)
Andersen, K, Hansen, M: Multiplicative censoring: density estimation by a series expansion approach. J. Stat. Plan. Inference 98, 137-155 (2001)
Asgharian, M, Carone, M, Fakoor, V: Large-sample study of the kernel density estimators under multiplicative censoring. Ann. Stat. 40, 159-187 (2012)
Abbaszadeh, M, Chesneau, C, Doosti, H: Multiplicative censoring: estimation of a density and its derivatives under the \(L_{p}\)-risk. REVSTAT Stat. J. 11(3), 255-276 (2013)
Efromovich, S: Density estimation for biased data. J. Stat. Plan. Inference 32(3), 1137-1161 (2004)
Chesneau, C: Wavelet block thresholding for density estimation in the presence of bias. J. Korean Stat. Soc. 39, 43-53 (2010)
Ramirez, P, Vidakovic, B: Wavelet density estimation for stratified size-biased sample. J. Stat. Plan. Inference 140, 419-432 (2010)
Chesneau, C: Regression with random design: a minimax study. Stat. Probab. Lett. 77, 40-53 (2007)
Donoho, DL, Johnstone, IM, Kerkyacharian, G, Picard, D: Density estimation by wavelet thresholding. Ann. Stat. 24, 508-539 (1996)
Li, LY: On the minimax optimality of wavelet estimators with censored data. J. Stat. Plan. Inference 137, 1138-1150 (2007)
Daubechies, I: Ten Lectures on Wavelets. SIAM, Philadelphia (1992)
Härdle, W, Kerkyacharian, G, Picard, D, Tsybakov, A: Wavelets, Approximations, and Statistical Applications. Lecture Notes in Statistics. Springer, Berlin (1998)
Liu, YM, Xu, JL: Wavelet density estimation for negatively associated stratified size-biased sample. J. Nonparametr. Stat. 26(3), 537-554 (2014)
Acknowledgements
This work is supported by the National Natural Science Foundation of China, Tian Yuan Foundation (11426040), the National Natural Science Foundation of China (11301558), Beijing Municipal Education Commission Project (KM201511417002) and Beijing Talents Found (2014000020124G065).
Author information
Authors and Affiliations
Corresponding author
Additional information
Competing interests
The authors declare that they have no competing interests.
Authors’ contributions
LH, LL, JX, and HW completed this work together. All four authors read and approved the final manuscript.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Hu, L., Xu, J., Li, L. et al. \(L^{p}\) convergence rate of wavelet estimators for the bias and multiplicative censoring model. J Inequal Appl 2015, 208 (2015). https://doi.org/10.1186/s13660-015-0732-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13660-015-0732-2