Abstract
Rank regression method has been widely pursued for robust inference in statistical models. Unfortunately, there does not exist related literature for the function-on-scalar model, which is the focus of this paper. We study the robust estimation based on rank regression and B-spline approximations for the function-on-scalar model and further establish the theoretical properties of the proposed method under regularity conditions. Extensive simulation studies and two real data applications are given to illustrate the merits of the proposed approach. Numerical results show that the proposed method is competitive with existing robust estimation procedures.
Similar content being viewed by others
References
Beyaztas U, Shang HL, Alin A (2022) Function-on-function partial quantile regression. J Agric Biol Environ Stat 27:149–174
Cai X, Xue L, Cao J (2021) Robust penalized M-estimation for function-on-function linear regression. Stat 10:e390
Cao G, Wang S, Wang L (2020) Estimation and inference for functional linear regression models with partially varying regression coefficients. Stat 9:e286
Cardot H, Crambes C, Sarda P (2005) Quantile regression when the covariates are functions. J Nonparametr Stat 17:841–856
Chen Y, Goldsmith J, Ogden R (2016) Variable selection in function-on-scalar regression. Stat 5:88–101
Denhere M, Bindele HF (2016) Rank estimation for the functional linear model. J Appl Stat 43:1928–1944
Feng L, Zou C, Wang Z (2012) Rank-based inference for the single-index model. Stat Probab Lett 82:535–541
Feng L, Zou C, Wang Z, Wei X, Chen B (2015) Robust spline-based variable selection in varying coefficient model. Metrika 78:85–118
Ferraty F, Vieu P (2006) Nonparametric functional data analysis: theory and practice. Springer, New York
Goldsmith J, Kitago T (2016) Assessing systematic effects of stroke on motorcontrol by using hierarchical function-on-scalar regression. J R Stat Soc Ser C Appl Stat 65:215–236
Guo C, Yang H, Lv J, Wu J (2016) Joint estimation for single index mean-covariance models with longitudinal data. J Korean Stat Soc 45:526–543
Guo C, Yang H, Lv J (2017) Robust variable selection in high-dimensional varying coefficient models based on weighted composite quantile regression. Stat Pap 58:1009–1033
He X, Fung W, Zhu Z (2005) Robust estimation in generalized partial linear models for clustered data. J Am Stat Assoc 100:1176–1184
Hettmansperger TP, McKean JW (2011) Robust nonparametric statistical methods, 2nd edn. Chapman and Hall, Boca Raton
Jaeckel LA (1972) Estimating regression coefficients by minimizing the dispersion of residuals. Ann Math Stat 43:1449–1458
Johnson BA, Peng L (2008) Rank-based variable selection. J Nonparametr Stat 20:241–252
Jureckova J (1971) Nonparametric estimate of regression coefficients. Ann Math Stat 42:1328–1338
Kato K (2012) Estimation in functional linear quantile regression. Ann Stat 40:3108–3136
Kim M (2007) Quantile regression with varying coefficients. Ann Stat 35:92–108
Knight K (1998) Limiting distributions for \(L_1\) regression estimators under general conditions. Ann Stat 26:755–770
Koenker R, Bassett G (1978) Regression quantiles. Econometrica 46:33–50
Leng C (2010) Variable selection and coefficient estimation via regularized rank regression. Ann Stat 20:167–181
Li J, Lian H, Jiang X, Song X (2018) Estimation and testing for time-varying quantile single-index models with longitudinal data. Comput Stat Data Anal 118:66–83
Lv J, Yang H, Guo C (2016) Robust estimation for varying index coefficient models. Comput Stat 31:1131–1167
Martínez-Hernández I (2019) Robust depth-based estimation of the functional autoregressive model. Comput Stat Data Anal 131:66–79
Noh H, Chung K, Keilegom I (2012) Variable selection of varying coefficient models in quantile regression. Electron J Stat 6:1220–1238
Ramsay JO, Silverman BW (2005) Functional data analysis. Springer, New York
Reiss PT, Huang L, Mennes M (2010) Fast function-on-scalar regression with penalized basis expansions. Int J Biostat 6:28
Sang P, Cao J (2020) Functional single-index quantile regression models. Stat Comput 30:771–781
Schumaker L (1981) Spline functions: basic theory. Wiley, New York
Sievers GL, Abebe A (2004) Rank estimation of regression coefficients using iterated reweighted least squares. J Stat Comput Simul 74:821–831
Smith SM, Jenkinson M, Johansen-Berg H, Rueckert D, Nichols TE et al (2006) Tract-based spatial statistics: voxelwise analysis of multi-subject diffusion data. Neuroimage 31:1487–1505
Stone CJ (1982) Optimal rates of convergence for nonparametric estimators. Ann Stat 8:1348–1360
Sun J, Lin L (2014) Local rank estimation and related test for varying-coefficient partially linear models. J Nonparametr Stat 26:187–206
Wang L, Li R (2009) Weighted Wilcoxon-type smoothly clipped absolute deviation method. Biometrics 65:564–571
Wang L, Kai B, Li R (2009) Local rank inference for varying coefficient models. J Am Stat Assoc 488:1631–1645
Wolberg G, Alfy I (2002) An energy-minimization framework for monotonic cubic spline interpolation. J Comput Appl Math 143:145–188
Yang J, Yang H, Lu F (2019) Rank-based shrinkage estimation for identification in semiparametric additive models. Stat Pap 60:1255–1281
Yu D, Kong L, Mizera I (2016) Partial functional linear quantile regression for neuroimaging data analysis. Neurocomputing 195:74–87
Zhang J, Chen J (2007) Statistical inferences for functional data. Ann Stat 35:1052–1079
Zhang F, Li R, Lian H, Bandyopadhyay D (2021) Sparse reduced-rank regression for multivariate varying-coefficient models. J Stat Comput Simul 91:752–767
Zhu H, Li R, Kong L (2012) Multivariate varying coefficient model for functional responses. Ann Stat 40:2634–2666
Acknowledgements
Mingtao Zhao’s research was supported by the University Social Science Research Project of Anhui Province (Grant No. SK2020A0051) and the Anhui Provincial Philosophy and Social Science Project (Grant No. AHSKF2022D08). Ning Li’s research was supported by the Anhui Natural Science Foundation project (Grant No. 2108085QA16) and the Anhui University Natural Science Research Project (Grant No. KJ2021A0997). Jing Yang’s research was supported by the Natural Science Foundation of Hunan Province (Grant No. 2022JJ30368) and the Scientific Research Fund of Hunan Provincial Education Department (Grant No. 22A0040).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix: Technical proofs
Appendix: Technical proofs
We denote the cdf and pdf of \(\varepsilon _{ijk}=\varepsilon _{ik}-\varepsilon _{jk}\) as \(G_k\) and \(g_k\) respectively. By simple algebra operation we can get \(g_k(\nu )=\int f_k(t)f_k(t-\nu )dt\). By the Corollary 6.21 of Schumaker (1981), there exists a vector \({\varvec{\lambda }}_0=({\varvec{\lambda }}^T_{01},\ldots ,{\varvec{\lambda }}^T_{0p})^T\) such that \(\Vert {\varvec{B}}^T(s){\varvec{\lambda }}_{0\ell }-\beta _{0\ell }(s)\Vert =O_{p}({J^{-r}_n})\) for \(\ell =1,\ldots ,p\). In the proofs, C denotes a generic constant that might assume different values at different places.
Proof of Theorem 1
Let \(\delta _n=n^{-r/(2r+1)}\), \({\varvec{u}}_n=\delta ^{-1}_n({\varvec{\lambda }}-{\varvec{\lambda }}_0)\) with \({\varvec{u}}_{n\ell }=\delta ^{-1}_n({\varvec{\lambda }}_\ell -{\varvec{\lambda }}_{0\ell })\). We want to show that for any given \(\eta >0\), there exists a large constant C such that, for large n we have
This implies that, with probability tending to one, there is local minimum \(\hat{{\varvec{\lambda }}}\) in the ball \(\{{\varvec{\lambda }}_0+\delta _{n}{\varvec{u}}_{n},\Vert {\varvec{u}}_{n}\Vert \le C\}\) such that \(\Vert \hat{{\varvec{\lambda }}}-{\varvec{\lambda }}_{0}\Vert =O_{p}(\delta _{n})\).
Let \(D_n={\mathcal {L}}_{n}({\varvec{\lambda }}_{0}+\delta _{n} {\varvec{u}}_{n})-{\mathcal {L}}_{n}({\varvec{\lambda }}_{0})\), then
where \(R_{ijk}={\varvec{\Pi }}^T_{ijk}{\varvec{\lambda }}_{0}-{\varvec{Z}}^T_i{\varvec{\beta }}_0(s_k)+{\varvec{Z}}^T_j{\varvec{\beta }}_0(s_k)\) with \({\varvec{\Pi }}_{ijk}={\varvec{\Pi }}_{ik}-{\varvec{\Pi }}_{jk}\).
By knight’s identity (Knight 1998),
we have
For \(I_1\), we have
where \(R(\varepsilon _{ik})\) is the rank statistic of \(\varepsilon _{ik}\), \(I_{11}= W^T_n{\varvec{u}}_{n}\) with \(W_n=-\frac{1}{n} \delta _{n}\sum _{i=1}^{n} {\varvec{\Pi }}_{ik}^{T}\{2 R(\varepsilon _{ik})-(n+1)\}\). By the independence between \({\varvec{Z}}_i\) and \(\varepsilon _{ik}\) and condition (C1), \({\text {E}}(W_n)=0\), \( {\text {Cov}}(W_n)=\frac{1}{n^2}\delta ^2_{n}{\varvec{\Pi }}^T_{k} {\text {Cov}}(\zeta _k){\varvec{\Pi }}_{k}\) for \({\varvec{\Pi }}_{k}=(\Pi _{1k},\ldots ,\Pi _{nk})^T\) and \(\zeta _k=(2 R(\varepsilon _{1k})-(n+1), \ldots , 2 R(\varepsilon _{nk})-(n+1))^{T}\). According to the Theorem 1 of Leng (2010), the diagonal terms of \(\frac{1}{n^2} {\text {Cov}}(\zeta )\) are
and its off-diagonal terms are
Thus, we have \({\text {E}}(I_{11})=0\) and \({\text {Var}}(I_{11})=\delta _{n}^{2} {\varvec{u}}^{T}_{n} {\varvec{\Pi }}^T_{k} n^{-2} {\text {Cov}}(\zeta ) {\varvec{\Pi }}_{k} {\varvec{u}}_{n} \rightarrow \frac{1}{3} \delta _{n}^{2} {\varvec{u}}^{T}_{n} {\varvec{\Pi }}^T_{k}{\varvec{\Pi }}_{k} {\varvec{u}}_{n}\).
Applying the Markov inequality, we have
where \(\lambda _{\max }(M)\) denotes the maximum eigenvalue of a positive definite matrix M. Hence, \(I_{11}=o_{p}(n \delta _{n}^{2})\Vert {\varvec{u}}_{n}\Vert \).
On the other hand, based on the independence between \({\varvec{Z}}_i\) and \(\varepsilon _{ik}\) and condition (C1), it follows that \({\text {E}}(I_{12})=0\). By the similar arguments of Theorem 1(a) of Noh et al. (2012), we have
Therefore, we have \(I_{12}=O_p(n \delta _{n}^{2})\Vert {\varvec{u}}_{n}\Vert \).
Next, use condition (C1) and the Markov inequality, taking the similar arguments as in Guo et al. (2017), it can be shown that
Thus,
By conditions (C1) and (C4), and the Lebesgue’s dominated convergence theorem, we have
Here we use the (13) in the third step.
By noting that
Again by (13), we have
Therefor, we have \(I_2=\frac{1}{2n} g_k(0)\delta _{n}^{2}{\varvec{u}}^T_{n}{\text {E}}(\sum _{i, j} {\varvec{\Pi }}_{ijk} {\varvec{\Pi }}^T_{ijk}) {\varvec{u}}_{n}+o_p(\sqrt{n \delta _{n}^{2}})\Vert {\varvec{u}}_{n}\Vert \). It follows from the above analysis that the \(D_n\) in (10) is dominated by the positive quadratic term of \(I_2\) as long as \(\Vert {\varvec{u}}_{n}\Vert \) is large enough. Hence (9) holds. In particular, this implies that \(\Vert {\hat{\beta }}_{\ell }(s)-\beta _{0\ell }(s)\Vert =O_{p}(n^{-r /(2 r+1)})\) for \(\ell =1,2,\ldots ,p\). We complete the proof of Theorem 1.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Sun, J., Zhao, M., Li, N. et al. Rank estimation for the function-on-scalar model. Comput Stat 39, 1807–1823 (2024). https://doi.org/10.1007/s00180-023-01414-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-023-01414-9