Skip to main content
Log in

Testing Regression Coefficients in High-Dimensional and Sparse Settings

  • Published:
Acta Mathematica Sinica, English Series Aims and scope Submit manuscript

Abstract

In the high-dimensional setting, this article considers a canonical testing problem in multivariate analysis, namely testing coefficients in linear regression models. Several tests for high-dimensional regression coefficients have been proposed in the recent literature. However, these tests are based on the sum of squares type statistics, that perform well under the dense alternatives and suffer from low power under the sparse alternatives. In order to attack this issue, we introduce a new test statistic which is based on the maximum type statistic and magnifies the sparse signals. The limiting null distribution of the test statistic is shown to be the extreme value distribution of type I and the power of the test is analysed. In particular, it is shown theoretically and numerically that the test is powerful against sparse alternatives. Numerical studies are carried out to examine the numerical performance of the test and to compare it with other tests available in the literature.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Anderson, T. W.: An Introduction to Multivariate Statistical Analysis, 3rd Ed., Wiley, New York, 2003

    MATH  Google Scholar 

  2. Arias-Castro, E., Candès, E., Plan, Y.: Global testing under sparse alternatives: ANOVA, multiple comparisons and the higher criticism. Ann. Statist., 39, 2533–2556 (2011)

    MathSciNet  MATH  Google Scholar 

  3. Bien, J., Tibshirani, R.: Sparse estimation of a covariance matrix. Biometrika, 98, 807–820 (2011)

    Article  MathSciNet  Google Scholar 

  4. Cai, T., Liu, W. D.: Adaptive thresholding for sparse covariance matrix estimation. J. Amer. Statist. Assoc., 106, 672–684 (2011)

    Article  MathSciNet  Google Scholar 

  5. Cai, T., Liu, W. D., Xia, Y.: Two-sample test of high dimensional means under dependence. J. R. Stat. Soc. Ser. B Stat. Methodol., 76, 349–372 (2014)

    Article  MathSciNet  Google Scholar 

  6. Cai, T., Xia, Y.: High-dimensional sparse MANOVA. J. Multivariate Anal., 131, 174–196 (2014)

    Article  MathSciNet  Google Scholar 

  7. Chernozhukov, V., Chetverikov, D., Kato, K.: Gaussian approximations and multiplier bootstrap for maxima of sums of high-dimensional random vectors. Ann. Statist., 41, 2786–2819 (2013)

    Article  MathSciNet  Google Scholar 

  8. Cui, H. J., Guo, W. W., Zhong, W.: Test for high-dimensional regression coefficients using refitted cross-validation variance estimation. Ann. Statist., 3, 958–988 (2018)

    MathSciNet  MATH  Google Scholar 

  9. Fan, J. Q., Feng, Y., Song, R.: Nonparametric independence screening in sparse ultra-high dimensional additive models. J. Amer. Statist. Assoc., 106, 544–557 (2011)

    Article  MathSciNet  Google Scholar 

  10. Fan, J. Q., Guo, S. J., Hao, N.: Variance estimation using refitted cross-validation in ultrahigh dimensional regression. J. R. Stat. Soc. Ser. B Stat. Methodol., 74, 37–65 (2012)

    Article  MathSciNet  Google Scholar 

  11. Fan, J. Q., Lv, J. C.: Sure independence screening for ultrahigh dimensional feature space (with discussion). J. R. Stat. Soc. Ser. B Stat. Methodol., 70, 849–911 (2008)

    Article  MathSciNet  Google Scholar 

  12. Feng, L., Zou, C. L., Wang, Z. J., et al.: Rank-based score tests for high-dimensional regression coefficients. Electron. J. Stat., 7, 2131–2149 (2013)

    Article  MathSciNet  Google Scholar 

  13. Hall, P., Jin, J. S.: Properties of higher criticism under strong dependence. Ann. Statist., 36, 381–402 (2008)

    MathSciNet  MATH  Google Scholar 

  14. Hall, P., Jin, J. S.: Innovated higher criticism for detecting sparse signals in correlated noise. Ann. Statist., 38, 1686–1732 (2010)

    MathSciNet  MATH  Google Scholar 

  15. Li, R. Z., Zhong, W., Zhu, L. P.: Feature screening via distance correlation learning. J. Amer. Statist. Assoc., 107, 1129–1139 (2012)

    Article  MathSciNet  Google Scholar 

  16. Goeman, J., Van De Geer, S., Finos, L.: Testing against a high dimensional alternative in the generalized linear model: asymptotic type I error control. Biometrika, 98, 381–390 (2011)

    Article  MathSciNet  Google Scholar 

  17. Guo, B., Chen, S. X.: Tests for high dimensional generalized linear models. J. R. Stat. Soc. Ser. B Stat. Methodol., 5, 1079–1102 (2016)

    Article  MathSciNet  Google Scholar 

  18. Scheetz, T. E., Kim, K. Y. A., Swiderski, R. E., et al.: Regulation of gene expression in the mammalian eye and its relevance to eye disease. Proc. Natl. Acad. Sci., 103, 14429–14434 (2006)

    Article  Google Scholar 

  19. Van de Geer, S., Bühlmann, P., Ritov, Y., et al.: On asymptotically optimal confidence regions and tests for high-dimensional models. Ann. Statist., 42, 1166–1202 (2014)

    Article  MathSciNet  Google Scholar 

  20. Zaïtsev, A. Yu: On the gaussian approximation of convolutions under multidimensional analogues of S.N. Bernsteins inequality conditions. Probab. Theory Related Fields, 74, 535–566 (1987)

    Article  MathSciNet  Google Scholar 

  21. Zhong, P., Chen, S.: Tests for high dimensional regression coefficients with factorial designs. J. Amer. Statist. Assoc., 106, 260–274 (2011)

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

We appreciate the constructive suggestions from the referees and the editors.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kai Xu.

Additional information

Supported by the National Natural Science Foundation of China (Grant No. 11901006), the Natural Science Foundation of Anhui Province (Grant No. 1908085QA06) and the Talent Foundation of Anhui Normal University (Grant No. 751811)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xu, K., Tian, Y. & Cheng, Q. Testing Regression Coefficients in High-Dimensional and Sparse Settings. Acta. Math. Sin.-English Ser. 37, 1513–1532 (2021). https://doi.org/10.1007/s10114-021-9468-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10114-021-9468-8

Keywords

MR(2010) Subject Classification

Navigation