Abstract
In the high-dimensional setting, this article considers a canonical testing problem in multivariate analysis, namely testing coefficients in linear regression models. Several tests for high-dimensional regression coefficients have been proposed in the recent literature. However, these tests are based on the sum of squares type statistics, that perform well under the dense alternatives and suffer from low power under the sparse alternatives. In order to attack this issue, we introduce a new test statistic which is based on the maximum type statistic and magnifies the sparse signals. The limiting null distribution of the test statistic is shown to be the extreme value distribution of type I and the power of the test is analysed. In particular, it is shown theoretically and numerically that the test is powerful against sparse alternatives. Numerical studies are carried out to examine the numerical performance of the test and to compare it with other tests available in the literature.
Similar content being viewed by others
References
Anderson, T. W.: An Introduction to Multivariate Statistical Analysis, 3rd Ed., Wiley, New York, 2003
Arias-Castro, E., Candès, E., Plan, Y.: Global testing under sparse alternatives: ANOVA, multiple comparisons and the higher criticism. Ann. Statist., 39, 2533–2556 (2011)
Bien, J., Tibshirani, R.: Sparse estimation of a covariance matrix. Biometrika, 98, 807–820 (2011)
Cai, T., Liu, W. D.: Adaptive thresholding for sparse covariance matrix estimation. J. Amer. Statist. Assoc., 106, 672–684 (2011)
Cai, T., Liu, W. D., Xia, Y.: Two-sample test of high dimensional means under dependence. J. R. Stat. Soc. Ser. B Stat. Methodol., 76, 349–372 (2014)
Cai, T., Xia, Y.: High-dimensional sparse MANOVA. J. Multivariate Anal., 131, 174–196 (2014)
Chernozhukov, V., Chetverikov, D., Kato, K.: Gaussian approximations and multiplier bootstrap for maxima of sums of high-dimensional random vectors. Ann. Statist., 41, 2786–2819 (2013)
Cui, H. J., Guo, W. W., Zhong, W.: Test for high-dimensional regression coefficients using refitted cross-validation variance estimation. Ann. Statist., 3, 958–988 (2018)
Fan, J. Q., Feng, Y., Song, R.: Nonparametric independence screening in sparse ultra-high dimensional additive models. J. Amer. Statist. Assoc., 106, 544–557 (2011)
Fan, J. Q., Guo, S. J., Hao, N.: Variance estimation using refitted cross-validation in ultrahigh dimensional regression. J. R. Stat. Soc. Ser. B Stat. Methodol., 74, 37–65 (2012)
Fan, J. Q., Lv, J. C.: Sure independence screening for ultrahigh dimensional feature space (with discussion). J. R. Stat. Soc. Ser. B Stat. Methodol., 70, 849–911 (2008)
Feng, L., Zou, C. L., Wang, Z. J., et al.: Rank-based score tests for high-dimensional regression coefficients. Electron. J. Stat., 7, 2131–2149 (2013)
Hall, P., Jin, J. S.: Properties of higher criticism under strong dependence. Ann. Statist., 36, 381–402 (2008)
Hall, P., Jin, J. S.: Innovated higher criticism for detecting sparse signals in correlated noise. Ann. Statist., 38, 1686–1732 (2010)
Li, R. Z., Zhong, W., Zhu, L. P.: Feature screening via distance correlation learning. J. Amer. Statist. Assoc., 107, 1129–1139 (2012)
Goeman, J., Van De Geer, S., Finos, L.: Testing against a high dimensional alternative in the generalized linear model: asymptotic type I error control. Biometrika, 98, 381–390 (2011)
Guo, B., Chen, S. X.: Tests for high dimensional generalized linear models. J. R. Stat. Soc. Ser. B Stat. Methodol., 5, 1079–1102 (2016)
Scheetz, T. E., Kim, K. Y. A., Swiderski, R. E., et al.: Regulation of gene expression in the mammalian eye and its relevance to eye disease. Proc. Natl. Acad. Sci., 103, 14429–14434 (2006)
Van de Geer, S., Bühlmann, P., Ritov, Y., et al.: On asymptotically optimal confidence regions and tests for high-dimensional models. Ann. Statist., 42, 1166–1202 (2014)
Zaïtsev, A. Yu: On the gaussian approximation of convolutions under multidimensional analogues of S.N. Bernsteins inequality conditions. Probab. Theory Related Fields, 74, 535–566 (1987)
Zhong, P., Chen, S.: Tests for high dimensional regression coefficients with factorial designs. J. Amer. Statist. Assoc., 106, 260–274 (2011)
Acknowledgements
We appreciate the constructive suggestions from the referees and the editors.
Author information
Authors and Affiliations
Corresponding author
Additional information
Supported by the National Natural Science Foundation of China (Grant No. 11901006), the Natural Science Foundation of Anhui Province (Grant No. 1908085QA06) and the Talent Foundation of Anhui Normal University (Grant No. 751811)
Rights and permissions
About this article
Cite this article
Xu, K., Tian, Y. & Cheng, Q. Testing Regression Coefficients in High-Dimensional and Sparse Settings. Acta. Math. Sin.-English Ser. 37, 1513–1532 (2021). https://doi.org/10.1007/s10114-021-9468-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10114-021-9468-8