Abstract
This paper deals with the nonparametric estimation of the expectile regression when the observations are spatially correlated and are of a functional nature. The main findings of this work is the establishment of the almost complete convergence for the proposed estimator under some general mixing conditions. The performance of the proposed estimator is examined by using simulated data. Finally, the studied model is used to evaluate the air quality indicators in northeast China.
Similar content being viewed by others
References
Aneiros G, Cao R, Fraiman R, Genest C, Vieu P (2019) Recent advances in functional data analysis and high-dimensional statistics. J Multivar Anal 170:3–9
Bellini F, Negri I, Pyatkova M (2019) Backtesting VaR and expectiles with realized scores. Stat Methods Appl 28:119–142
Bersimis S, Degiannakis S, Georgakellos D (2017) Real time monitoring of carbon monoxide using value-at-risk measure and control charting. J Appl Stat 1:89–118
Biau G, Cadre B (2004) Nonparametric spatial prediction. Stat Inference Stoch Process 7:327–349
Bogachev VI (1999) Gaussian measures. Math surveys and monographs, vol 62. American Mathematical Society
Bosq D (2000) Linear processes in function spaces: theory and applications. Lecture notes in statistics, vol 149. Springer, New-York
Campbell S (2005) A review of backtesting and backtesting procedures. Finance and economics discussion series, Federal Reserve Board, Washington
Carbon M, Hallin M, Tran LT (1996) Kernel density estimation for random fields: the \(L^1\)-theory. J Nonparametric Stat 6:157–170
Chen JM (2018) On exactitude in financial regulation: value-at-risk, expected shortfall, and expectiles. Risks 6:1–28
Dabo-Niang S, Yao AF (2007) Kernel regression estimation for continuous spatial processes. Math Methods Stat 16(4):298–317
Daouia A, Girard S, Stupfler G (2018) Estimation of tail risk based on extreme expectiles. J R Stat Soc Ser B Stat Methodol 80(2):263–292
Delicado P, Giraldo R, Comas C, Mateu J (2010) Statistics for spatial functional data: some recent contributions. Environmetrics 21:224–239
Doukhan P (1994) Mixing: properties and examples. Lecture notes in statistics, vol 85. Springer, New York
Efron B (1991) Regression percentiles using asymmetric squared error loss. Stat Sin 1(1):93–125
Ferraty F, Vieu P (2006) Nonparametric functional data analysis. Theory and practice, Springer, New York
Guo M, Härdle W (2012) Simultaneous confidence bands for expectile functions. AStA Adv Stat Anal 96(4):517–541
Guyon X (1987) Estimation d’un champ par pseudo-vraisemblance conditionnelle: Etude asymptotique et application au cas Markovien. In: Proceedings of the sixth Franco-Belgian meeting of statisticians
Hallin M, Lu Z, Yu K (2009) Local linear spatial quantile regression. Bernoulli 15(3):659–686
Jenish NI, Prucha R (2009) Central limit theorems and uniform laws of large numbers for arrays of random fields. J Econom 150:86–98
Laksaci A, Lemdani M, Ould-Saïd E (2009) A generalized \(L^1\)-approach for a kernel estimator of conditional quantile with functional regressors: consistency and asymptotic normality. Stat Probab Lett 79:1065–1073
Laksaci A, Rachdi M, Rahmani S (2013) Spatial modelization: local linear estimation of the conditional distribution for functional data. Spat Stat 6:1–23
Li J, Tran LT (2009) Nonparametric estimation of conditional expectation. J Stat Plan Inference 139:164–175
Ling N, Vieu P (2018) Nonparametric modelling for functional data: selected survey and tracks for future. Statistics 52:934–949
Masry E (2005) Nonparametric regression estimation for dependent functional data: asymptotic normality. Stoch Process Appl 115:155–177
Maume-Deschamps V, Rullière D, Said K (2018) Asymptotic multivariate expectiles. arXiv:1704.07152, revised Jan 2018
Moore M (1988) Spatial linear processes. Commun Stat Stoch Models 4(1):45–75
Nerini D, Monestiez P, Manté C (2010) Cokriging for spatial functional data. J Multivar Anal 101:409–418
Quintela-del-Río A, Francisco-Fernández M (2011) Nonparametric functional data estimation applied to ozone data: prediction and extreme value analysis. Chemosphere 82(6):800–808
Rachdi M, Vieu P (2007) Nonparametric regression for functional data: automatic smoothing parameter selection. J Stat Plan Inference 137(9):2784–2801
Ramsay JO, Silverman BW (2005) Functional data analysis, 2nd edn. Springer, New York
Robinson PM (2011) Asymptotic theory for nonparametric regression with spatial data. J Econom 165:5–19
Shang HL (2014) Bayesian bandwidth estimation for a functional nonparametric regression model with mixed types of regressors and unknown error density. J Nonparametric Stat 26:599–615
Tran LT (1990) Kernel density estimation on random fields. J Multivar Anal 34:37–53
Tjostheim D (1978) Statistical spatial series modelling. Adv Appl Probab 10:130–154
Zhao J, Chen Y, Zhang Y (2018) Expectile regression for analyzing heteroscedasticity in high dimension. Stat Probab Lett 137:304–311
Acknowledgements
The authors are indebted to the Editor-in-Chief, the Associate Editor and the referees for their very valuable comments and suggestions which led to a considerable improvement of the manuscript. The authors also thank and extend their appreciation to the Deanship of Scientific Research at King Khalid University for funding this work through the Research Groups Program under Grant Number R.G.P.1/64/42
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
In order to keep the standard techniques in nonparametric spatial statistics, we will assume for our asymptotic results, without loss of generality, that the set of sites of observations \(I_n\) is a rectangular region. Next, we state the following two lemmas that were used by Carbon et al. (1996). These lemmas will be useful to prove the strong convergence of our estimators. Their proofs are obviously omitted.
Lemma 1
Let \(E_1,\ldots , E_r\) be r sets containing m sites each with \(dist(E_i,\ E_j)\ge \gamma \) for all \(i\ne j\) where \(1\le i, j\le r\). Assume that \(Z_1, \ldots , Z_r\) is a sequence of real-valued random variables which are measurable with respect to \(\mathscr {B}(E_1),\ldots ,\mathscr {B}(E_r)\) respectively, and that \(Z_i\) takes values in [a, b]. Then, there exists a sequence of independent random variables \(Z_1^*,\ldots ,Z_r^*\) which is independent of \(Z_1, \ldots ,Z_r\) such that \(Z_i^*\) has the same distribution as \(Z_i\) and satisfies
Lemma 2
(i) Suppose that (1) holds. Denote by \(\mathcal {L}_r(\mathcal {F})\) the class of \(\mathcal {F}-\)measurable r.v.’s X satisfying \(\Vert X\Vert _r=({\mathbb {E}}|X|^r)^{1/r}<\infty \). Suppose \(X \in \mathcal {L}_r(\mathscr {B}(E))\) and \(Y \in \mathcal {L}_r(\mathscr {B}(E'))\). Assume also that \(1\le r,\ s,\ t<\infty \) and \(r^{-1}+s^{-1}+t^{-1}=1\). Then
(ii) For r.v.’s bounded with probability 1, the right-hand side of (7) can be replaced by
Proof of Theorem 1 For \(\epsilon >0\) small enough, we use the fact that the function \(G(x,\cdot )\) is an increasing function and has a strictly positive derivative in the neighbourhood of \(\xi _p(x)\), to write that
So, it suffices to apply the result of the Proposition 1 to conclude the result of Theorem 1.
Proof of Proposition 1 We decompose \( \widehat{G}(x,t)-G(x,t)\) as follows
Thus, Proposition 1 as well as Theorem 1’s results are direct consequences of the following intermediate results.
Lemma 3
Under the assumptions of Lemma 4 and, in addition, we assume that the assumptions (A2) and (A4) hold, then we have, as \({n} \rightarrow \infty \),
and
where \(O_{a.co.}(\cdot )\) denotes O almost completely.
Lemma 4
Assume that assumptions (A1), (A3) and (A5) are fulfilled. We have, as \({n} \rightarrow \infty \),
and
Lemma 5
Under the assumptions of Lemma 4 and, in addition, we assume that the assumptions (A2) and (A4) are satisfied, we have, as \({n} \rightarrow \infty \),
Then, in order to show the consistency of \(\widehat{\xi _p}(x)\), it suffices to show that the estimator \( \widehat{G}_{2}(x,t)\) converges almost completely, as \({n} \rightarrow \infty \).
Proof of Lemma 3. Because of the similarity between the first and the second cases we prove only the first one. To do that, we combine the arguments of Laksaci et al. (2009) in FDA to those of Tran (1990) in the spatial model (finite dimension). Indeed, the compactness of \( [\xi _p(x)-\epsilon _0, \, \xi _p(x)+\epsilon _0]\) allows to recover it by \(d_{n}=O\left( \widehat{n}^{1/2 \varsigma _1}\right) \) intervals whose the subset of their extremities is
with \(l_{n}=O(d_{n}^{-1})\). Next, using the monotony of \( \widehat{G}_{1}(x,\cdot )\) and \({\mathbb {E}}[ \widehat{G}_{1}(x,\cdot )]\) to conclude that
Evidently, we have
So, all it remains to prove is that
which requires to show that
So, for all \(z=t_j\mp l_{n}\) with \(1\le j\le d_{n}\), we must evaluate the quantity
For this aim, we use the truncation method, where we define
such that \(Y_{\mathbf{i}}^*= (Y_\mathbf{i}-t)\mathbb {I}_{\big \{Y_\mathbf{i}-t\le 0, \, |Y_\mathbf{i}|< \gamma _{{n}}\big \}}\) and \(K_\mathbf{i}(x)=K\Big (h^{-1}_{n}d(x,X_\mathbf{i})\Big ) \).
So, the claimed quantity (14) is a consequence of the following three quantities:
and
-
Concerning the quantity (15). The Markov’s inequality permits to write that,
$$\begin{aligned} {\mathrm {I\!P}}\Big (\Big |\widehat{G^{*}_1}(x,t)- \widehat{G}_1(x,t)\Big |> \epsilon _0\left( \sqrt{\frac{\log \widehat{n}}{\widehat{n}\,\phi _x(h_{n})}}\right) \Big )\le & {} \sum _{\mathbf{i}\in I_n} {\mathrm {I\!P}}\Big (Y_{\mathbf{i}}>\gamma _{{n}}\Big ) \\\le & {} \widehat{n}\gamma _{{n}}^{-q}{\mathbb {E}}\Big [Y^q\Big ]. \end{aligned}$$It follows that
$$\begin{aligned} {\mathrm {I\!P}}\left( \left| \widehat{G^{*}_1}(x,t)-\widehat{G}_1(x,t)\right| >{\eta }\sqrt{\frac{\log \widehat{n}}{\widehat{n}\phi _{x}(h_{n})}}\right) \le C \widehat{n}\gamma _{{n}}^{-q}. \end{aligned}$$(18) -
For the quantity (16). The main tool of this part is the Hölder’s inequality. Indeed, put \(\displaystyle \alpha =\frac{q}{2}\) then for \( \beta \) such that
$$ \frac{1}{\alpha }+\frac{1}{\beta }=1$$we have
$$\begin{aligned}&\Big |{\mathbb {E}}[\widehat{G^{*}_1}(x,t)]- {\mathbb {E}}[\widehat{G}_1(x,t)]\Big |\\&\quad \le \frac{C}{ \mathbb {E}\Big [K_\mathbf{1}(x)\Big ]}{\mathbb {E}}\Big [\big |Y^-\big |\mathbb {I}_{\big \{Y\ge \gamma _{{n}}\big \}}K\Big (h^{-1}_{n}d(x,X)\Big )\Big ]\\&\quad \le \frac{\gamma _{{n}}^{-1}}{\mathbb {E}\Big [K_\mathbf{1}(x)\Big ]} {\mathbb {E}}^{1/\alpha }\Big [\big |Y^{2\alpha }\big |\Big ] {\mathbb {E}}^{1/\beta }\Big [K^{\beta }\Big (h^{-1}_{n}d(x,X_1)\Big )\Big ]\\&\quad \le \frac{C\gamma _{{n}}^{-1}}{\mathbb {E}\Big [K_\mathbf{1}(x)\Big ]} {\mathbb {E}}^{1/\beta }\Big [K_\mathbf{1}^{\beta }(x)\Big ].\end{aligned}$$We deduce that
$$\begin{aligned} \Big |{\mathbb {E}}[\widehat{G^{*}_1}(x,t)]- {\mathbb {E}}[\widehat{G}_1(x,t)]\Big |\le C\gamma _{{n}}^{-1}\phi _x^{(1-\beta )/\beta }(h_{n}). \end{aligned}$$(19) -
About the quantity (17). We use the spatial blocks decomposition which was used by Tran (1990). Indeed, we write
$$\begin{aligned} \widehat{G^{*}_1}(x,t) -{\mathbb {E}}[\widehat{G^{*}_1}(x,t)]= {1\over \widehat{n}\mathbb {E}\Big [K_\mathbf{1}(x)\Big ]}\sum _{\mathbf{i}\in I_n} K_{\mathbf{i}}(x) Y^*_{\mathbf{i}}-{\mathbb {E}}\Big [K_\mathbf{1}(x)Y^*_{\mathbf{i}}\Big ], \end{aligned}$$where \( \displaystyle \Lambda _{\mathbf{i}}=K_{\mathbf{i}}(x) Y^*_{\mathbf{i}}-{\mathbb {E}}\Big [K_\mathbf{1}(x)Y^*_{\mathbf{i}}\Big ]. \) The spatial blocks decomposition of this sum are defined, for an arbitrary integer sequence \((p_{n})\) by taking
$$\begin{aligned} U(1,{n},\mathbf{j})= & {} \sum _{{\mathop {k=1,\ldots ,N}\limits ^{i_k=2j_kp_{n}+1}}}^{2j_kp_{n}+p_{n}}\Lambda _{\mathbf {i}},\\ U(2,{n},\mathbf{j})= & {} \displaystyle \sum _{{\mathop {k=1,\ldots ,N-1}\limits ^{i_k=2j_kp_{n}+1}}}^{2j_kp_{n}+p_{n}} \quad \displaystyle \sum _{i_N=2j_Np_{n}+p_{n}+1}^{(j_N+1)p_{n}}\Lambda _{\mathbf {i}},\\ U(3,{n},\mathbf{j})= & {} \displaystyle \sum _{{\mathop {k=1,\ldots ,N-2}\limits ^{i_k=2j_kp_{n}+1}}}^{2j_kp_{n}+p_{n}} \quad \displaystyle \sum _{i_{N-1}=2j_{N-1}p_{n}+p_{n}+1}^{2(j_{N-1}+1)p_{n}} \quad \displaystyle \sum _{i_N=2j_Np_{n}+1}^{2j_Np_{n}+p_{n}}\Lambda _{\mathbf {i}},\\ U(4,{n},\mathbf{j})= & {} \displaystyle \sum _{{\mathop {k=1,\ldots ,N-2}\limits ^{i_k=2j_kp_{n}+1}}}^{2j_kp_{n}} \quad \displaystyle \sum _{i_{N-1}=2j_{N-1}p_{n}+p_{n}+1}^{2(j_{N-1}+1)p_{n}}\quad \displaystyle \sum _{i_N=2j_Np_{n}+p_{n}+1}^{2(j_N+1)p_{n}}\Lambda _{\mathbf {i}}, \end{aligned}$$$$\vdots $$The last term is
$$\begin{aligned} U(2^N,{n},\mathbf{j})= & {} \sum _{{\mathop {k=1,\ldots ,N}\limits ^{i_k=2j_kp_{n}+p_{n}+1}}}^{2(j_k+1)p_{n}} \Lambda _{\mathbf {i}}. \end{aligned}$$Finally, we get
$$\begin{aligned} \widehat{G^{*}_1}(x,t) -{\mathbb {E}}[\widehat{G^{*}_1}(x,t)]=\frac{1}{\widehat{n}\mathbb {E}\Big [K_\mathbf{1}(x)\Big ]}\displaystyle \sum _{i=1}^{2^N}T(n,i), \end{aligned}$$(20)with \(\displaystyle T(n,i)=\sum _{\mathbf {j}\in \mathcal {J}}U(i,n,\mathbf {j})\); \(\displaystyle \mathcal {J}=\big \{0,\ldots ,r_1-1\big \}\times \cdots \times \big \{0,\ldots ,r_N-1\big \}\) and \(\displaystyle r_{{ i}} =2 n_{{ i}}\big /p_{n}\) for \(i = 1, \ldots ,N\).
Furthermore, from (20) we get, for all \(\eta >0\)
Now, the claimed result is based on the evaluation of the quantities
Once again, we only treated the case \(i=1\). The other cases are similar. Indeed, the main step of the proof is the application of Lemma 1 on the random variables \(U(1,n,\mathbf {j})\) for \(\mathbf {j}\in \mathcal {J}\). Precisely, we enumerate these variables in the arbitrary way \(Z_1,\ldots , Z_M\) with \(M=\prod _{k=1}^N r_k=2^{-N}\widehat{n}p_{n}^{-N}\le \widehat{n}p_{n}^{-N}\) where for each \(Z_{{j}}\), there exists certain \(\mathbf {j}\) in \(\mathcal {J}\) such that
where \(\displaystyle I(1,n,\mathbf {j})=\Big \{\mathbf {i}:2j_{k}p_{n}+1\le i_k\le 2j_{k}p_{n}+p_{n} \hbox { for } k=1,\ldots ,N\Big \}.\)
Clearly the sets \(\displaystyle I(1,n,\mathbf {j})\) contain \(p_{n}^N \) sites and are far apart by distant of \(p_{n} \) at least. Moreover, under Assumption (A4), we have
Therefore, the Lemma 1 gives M independent random variables \(Z_1^*,\ldots ,Z_M^*\) which are identically distributed as \(Z_j\) for \(j=1,\ldots , M\) and such that
Therefore, we have
where
and
Firstly, the asymptotic evaluation of the term \(B_2(n)\) is based on the Markov inequality and (21), which allow to write that
Next, by using the facts that
and
we get for \(\eta =\displaystyle \eta _0\sqrt{\frac{\log \widehat{n}}{\widehat{n}\, \phi _x(h_{n})}}\), that
So, for the particular choice of \(p_{n}= C\,\Bigg (\displaystyle \frac{\widehat{n}\phi _x(h_{n})}{\log \widehat{n}\gamma _{{n}}^2}\Bigg )^{1/2N}\), we get
Secondly, the asymptotic evaluation of the \(B_1(n)\) is based on the Bernstein inequality. This last allows to write that
Now, the rest of the proof is based on the asymptotic evaluation of the variance term
To do that we evaluate separately the variance and the covariance terms under Assumption (A2). Indeed, on one hand, under the first part of Assumption (A2), we have
It follows that
Concerning the covariance term, we use the second part of Assumption (A2), for all \(\mathbf {i}\not =\mathbf {j}\)
Furthermore, since \({\mathbb {E}}\Big [Y^{p}_{\mathbf{i}}|X_{\mathbf{i}}\Big ]<\infty \) then, for all \(\mathbf {i}\not =\mathbf {j}\)
Now, we use the same technique as in Masry (2005). The letter is based on the following decomposition
where \((c_{{n}})\) is an arbitrary sequence of real numbers which tends to \(+\infty \). In particular, for \(c_{{n}}= \phi _x(h_{n})^{2/Np(a+1)-1/Na}\) we have
We deduce that
Finally, we replace \( var\Big [Z_1^{*}\Big ]=O\Big (p_{n}^N\phi _x(h_{n})\Big )\) in (23) to show that
We combine equations (14), (19), (22) and (24) together with Assumption (H5) to conclude that
which complete the proof of this lemma.
Proof of Lemma 4 Again, we only prove the first result and the second one can be obtained by the same manner. the proof is based on the stationarity of the process, which allows to write that
Now, by Assumption (A4), we get
Furthermore, Assumption (A3) allows to get
which easily implies that
Proof of Lemma 5. Observe that
By applying Lemma 3’s result for \(\displaystyle \epsilon _0=\frac{1}{2}\inf _{t \in [\xi _p(x)-\epsilon _0, \, \xi _p(x)+\epsilon _0]}G_2(x,t)>0,\) we obtain
hence this lemma’s proof is complete.
Rights and permissions
About this article
Cite this article
Rachdi, M., Laksaci, A. & Al-Kandari, N.M. Expectile regression for spatial functional data analysis (sFDA). Metrika 85, 627–655 (2022). https://doi.org/10.1007/s00184-021-00846-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00184-021-00846-x