Skip to main content
Log in

A high-dimensional test on linear hypothesis of means under a low-dimensional factor model

  • Published:
Metrika Aims and scope Submit manuscript

Abstract

In this paper, the problem of testing the hypothesis of linear combination of k-sample means of high-dimensional data is investigated under a low-dimensional factor model. We propose a new test and derive that the asymptotic distribution of the test statistic is a weighted distribution of independent chi-squared distribution of 1 degree of freedom under the null hypothesis and mild conditions. We provide numerical studies on both sizes and powers to illustrate performance of the proposed test.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

References

  • Ahn SC, Horenstein AR (2013) Eigenvalue ratio test for the number of factors. Econometrica 81(3):1203–1227

    Article  MathSciNet  Google Scholar 

  • Bai Z, Saranadasa H (1996) Effect of high dimension: by an example of a two sample problem. Statist Sinica 6(2):311–329

    MathSciNet  MATH  Google Scholar 

  • Birnbaum A, Johnstone IM, Nadler B, Paul D (2013) Minimax bounds for sparse PCA with noisy high-dimensional data. Ann Stat 41(3):1055–1084

    Article  MathSciNet  Google Scholar 

  • Box GEP (1954) Some theorems on quadratic forms applied in the study of analysis of variance problems. I. effect of inequality of variance in the one-way classification. Ann Math Stat 25:290–302

    Article  MathSciNet  Google Scholar 

  • Cai T, Ma Z, Wu Y (2013) Sparse PCA: optimal rates and adaptive estimation. Ann Stat 41(6):3074–3110

    Article  MathSciNet  Google Scholar 

  • Cao M, Sun P et al (2020) A test on linear hypothesis of \(k\)-sample means in high-dimensional data. Stat Interface 13(1):27–36

    Article  MathSciNet  Google Scholar 

  • Chen S, Qin Y (2010) A two-sample test for high-dimensional data with applications to gene-set testing. Ann Stat 38(2):808–835

    Article  MathSciNet  Google Scholar 

  • Chen L, Paul D, Prentice RL, Wang P (2011) A regularized Hotellings \(T^{2}\) test for pathway analysis in proteomic studies. J Amer Stat Assoc 106(496):1345–1360

    Article  Google Scholar 

  • Chen S, Zhang L, Zhong P (2010) Tests for high-dimensional covariance matrices. J Amer Stat Assoc 105(490):810–819

    Article  MathSciNet  Google Scholar 

  • Fujikoshi Y, Himeno T, Wakaki H (2004) Asymptotic results of a high dimensional MANOVA test and power comparison when the dimension is large compared to the sample size. J Japan Stat Soc 34(1):19–26

    Article  MathSciNet  Google Scholar 

  • Hyodo M (2017) Tests for the parallelism and flatness hypotheses of multi-group profile analysis for high-dimensional elliptical populations. J Multiv Anal 162:82–92

    Article  MathSciNet  Google Scholar 

  • Hyodo M, Nishiyama T (2018) A simultaneous testing of the mean vector and the covariance matrix among two populations for high-dimensional data. TEST 27(3):680–699

    Article  MathSciNet  Google Scholar 

  • Hyodo M, Nishiyama T (2021) Simultaneous testing of the mean vector and covariance matrix among \(k\) populations for high-dimensional data. Commun Stat Theory Methods 50(3):663–684

    Article  MathSciNet  Google Scholar 

  • Hyodo M, Nishiyama T, Pavlenko T (2020) On error bounds for high-dimensional asymptotic distribution of \(L_2\)-type test statistic for equality of means. Stat Probab Lett 157:108637

    Article  Google Scholar 

  • Hyodo M, Takahashi S, Nishiyama T (2014) Multiple comparisons among mean vectors when the dimension is larger than the total sample size. Commun Stat Simulation Comput 43(10):2283–2306

    Article  MathSciNet  Google Scholar 

  • Hyodo M, Watanabe H, Seo T (2018) On simultaneous confidence interval estimation for the difference of paired mean vectors in high-dimensional settings. J Multiv Anal 168:160–173

    Article  MathSciNet  Google Scholar 

  • Jiang D (2017) Likelihood-based tests on moderate-high-dimensional mean vectors with unequal covariance matrices. J Korean Stat Soc 46(3):451–461

    Article  MathSciNet  Google Scholar 

  • Lam C, Yao Q (2012) Factor modeling for high-dimensional time series: inference for the number of factors. Ann Stat 40(2):694–726

    Article  MathSciNet  Google Scholar 

  • Li H, Hu J et al (2017) Test on the linear combinations of mean vectors in high-dimensional data. TEST 26(1):188–208

    Article  MathSciNet  Google Scholar 

  • Ma Y, Lan W, Wang H (2015) A high dimensional two-sample test under a low dimensional factor structure. J Multiv Anal 140:162–170

    Article  MathSciNet  Google Scholar 

  • Nishiyama T, Hyodo M et al (2013) Testing linear hypotheses of mean vectors for high dimension data with unequal covariance matrices. J Stat Plan Inference 143(11):1898–1911

    Article  MathSciNet  Google Scholar 

  • Passemier D, Li Z, Yao J (2017) On estimation of the noise variance in high dimensional probabilistic principal component analysis. J R Stat Soc Ser B Stat Methodol 79(1):51–67

    Article  MathSciNet  Google Scholar 

  • Satterthwaite FE (1941) Synthesis of variance. Psychometrika 6:309–316

    Article  MathSciNet  Google Scholar 

  • Thulin M (2014) A high-dimensional two-sample test for the mean using random subspaces. Comput Stat Data Anal 74:26–38

    Article  MathSciNet  Google Scholar 

  • Wang H (2012) Factor profiled sure independence screening. Biometrika 99(1):15–28

    Article  MathSciNet  Google Scholar 

  • Wang R, Xu X (2018) On two-sample mean tests under spiked covariances. J Multiv Anal 167:225–249

    Article  MathSciNet  Google Scholar 

  • Watanabe H, Hyodo M, Nakagawa S (2020) Two-way MANOVA with unequal cell sizes and unequal cell covariance matrices in high-dimensional settings. J Multiv Anal 179:104625

    Article  MathSciNet  Google Scholar 

  • Welch BL (1947) The generalization of ‘Student’s’ problem when several different population variances are involved. Biometrika 34:28–35

  • Zhang J, Guo J, Zhou B (2017) Linear hypothesis testing in high-dimensional one-way MANOVA. J Multiv Anal 155:200–216

    Article  MathSciNet  Google Scholar 

  • Zhang J, Guo J, Zhou B, Cheng M (2020) A simple two-sample test in high dimensions based on \(L^2\)-norm. J Amer Stat Assoc 115(530):1011–1027

    Article  Google Scholar 

  • Zhou B, Guo J, Zhang J (2017) High-dimensional general linear hypothesis testing under heteroscedasticity. J Stat Plan Inference 188:36–54

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

The authors thank the Editor-in-Chief, Professor Maria Kateri, an associated editor and two anonymous referees for their constructive comments, suggestions and detailed advice that vastly improved this article. We also express our thanks to JEO assistant, Ms. Sangamithrai for her help in the process of revision of our paper. Cao’s research is supported by the National Statistical Science Research Program (No. 2020LY002), the National Natural Science Foundation of China (Nos. 11601008, 11526070) and Doctor Startup Foundation of Anhui Normal University (No. 2016XJJ101).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mingxiang Cao.

Ethics declarations

Conflict of interest

On behalf of author, the corresponding author states that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cao, M., He, Y. A high-dimensional test on linear hypothesis of means under a low-dimensional factor model. Metrika 85, 557–572 (2022). https://doi.org/10.1007/s00184-021-00841-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00184-021-00841-2

Keywords

Mathematics Subject Classification

Navigation