Abstract
We study a permutation procedure to test the equality of mean vectors, homogeneity of covariance matrices, or simultaneous equality of both mean vectors and covariance matrices in multivariate paired data. We propose to use two test statistics for the equality of mean vectors and the homogeneity of covariance matrices, respectively, and combine them to test the simultaneous equality of both mean vectors and covariance matrices. Since the combined test has composite null hypothesis, we control its type I error probability and theoretically prove the asymptotic unbiasedness and consistency of the combined test. The new procedure requires no structural assumption on the covariances. No distributional assumption is imposed on the data, except that the permutation test for mean vector equality assumes symmetric joint distribution of the paired data. We illustrate the good performance of the proposed approach with comparison to competing methods via simulations. We apply the proposed method to testing the symmetry of tooth size in a dental study and to finding differentially expressed gene sets with dependent structures in a microarray study of prostate cancer.
Similar content being viewed by others
References
Anderson TW (2003) An introduction to multivariate statistical analysis, 3rd edition. Wiley, New York
Aslan B, Zech G (2005) New test for the multivariate two-sample problem based on the concept of minimum energy. J Stat Comput Simul 75: 109–119
Barrett T, Suzek TO, Troup DB, Wilhite SE, Ngau WC, Ledoux P, Rudnev D, Lash AE, Fujibuchi W, Edgar R (2005) NCBI GEO: mining millions of expression profiles—database and tools. Nucleic Acids Res 33(Database issue): D562–566
Bradley E, Blackwood LG (1989) Comparing paired data: a simultaneous test for means and variances. Am Stat 43: 234–235
Chung JH, Fraser DAS (1958) Randomization tests for a multivariate two-sample problem. J Am Stat Assoc 53: 729–735
Choi SC, Wette R (1972) A test for homogeneity of variances among correlated variables. Biometrics 28: 589–592
Conover WJ, Johnson ME, Johnson MM (1981) A comparative study of tests for homogeneity of variances, with applications to the outer continental shelf bidding data. Technometrics 23: 351–361
de Leon A (2007) One-sample likelihood ratio tests for mixed data. Commun Stat A Theory Methods 36: 129–141
Efron B, Tibshirani R (2007) On testing the significance of sets of genes. Ann Appl Stat 1: 107–129
Friedman J, Hastie T, Tibshirani R (2008) Sparse inverse covariance estimation with the graphical lasso. Biostatistics 9: 432–441
Garn SM, Lewis AB, Kerewsky RS (2002) The meaning of bilateral asymmetry in the permanent dentition. Am J Orthodontics Dentofacial Orthopedics 36: 55–62
Goutis C, Casella G, Wells MT (1996) Assessing evidence in multiple hypotheses. J Am Stat Assoc 91: 1268–1277
Han CP (1968) Testing the homogeneity of a test of correlated variances. Biometrika 55: 317–326
Harris P (1985) Testing for variance homogeneity of correlated variables. Biometrika 72: 103–107
Hirotsu C (1986) Cummulative chi-squared statistic or a tool for testing goodness of fit. Biometrika 73: 165–173
Hirotsu C (1998) Max t test for analysing a dose-response relationship—an efficient algorithm for p value calculation. In: Pronzato L (ed) Volume of abstracts of MODA-5 (5th International conference on advances in model oriented data analysis and experimental design). CIRM, Marseille
Hotelling H (1931) The generalization of Student’s ratio. Ann Math Stat 2: 360–378
Kariya T (1981) A ronustness propertity of Hotelling’s T 2-test. Ann Stat 9: 211–214
Kim JY, Lee SJ, Kim TW, Nahm DS, Chang YI (2005) Classification of the skeletal variation in normal occlusion. Angle Orthodontics 75: 303–311
Lauritzen SL (2004) Graphical models. Oxford University Press, New York
Lee SJ, Lee S, Lim J, Ahn SJ, Kim TW (2007) Cluster analysis of human tooth size in subjects with normal occlusion. Am J Orthodontics Dentofacial Orthopedics 132: 796–800
Lim J, Li E, Lee SJ (2010) Likelihood ratio tests of correlated multivariate samples. Journal of Multivariate Anal 101: 541–554
Newton M, Quintana F, Den Boon J, Sengupta S, Ahlquist P (2007) Random-set methods identify distinct aspects of the enrichment signal in gene-set analysis. Ann Appl Stat 1: 85–106
Olson CL (1974) Comparative robustness of six tests in multivariate analysis of variance. J Am Stat Assoc 69: 894–908
Perlman MD (1980) Unbiasedness of the likelihood ratio tests for equality of several covariance matrices and equality of several multivariate normal populations. Ann Stat 8: 247–263
Pesarin F (2001) Multivariate permutation tests with applications in biostatistics. Wiley, New York
Srivastava MS, Awan HM (1982) On the robustness of Hotelling T 2-test and distribution of linear and quadratic forms in sampling from a mixture of two multivariate normal populations. Commun Stat Theory Methods 11: 81–107
Subramanian A, Tamayoa P, Moothaa VK, Mukherjeed S, Eberta BL, Gillettea MA, Paulovichg A, Pomeroy SL, Goluba TR, Landera ES, Mesirova JP (2005) Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA 102: 15545–15550
Szatrowski TH (1979) Asymptotic nonnull distributions for likelihood ratio statistics in the multivariate normal patterned mean and covariance matrix testing problem. Ann Stat 7: 823–837
Uysal T, Sari Z, Basciftci FA, Memili B (2005) Intermaxillary tooth size discrepancy and malocclusion: is there a relation?. Angle Orthodontics 75: 204–209
Yu YP, Landsittel D, Jing L, Nelson J, Ren B, Liu L, McDonald C, Thomas R, Dhir R, Finkelstein S, Michalopoulos G, Becich M, Luo JH (2004) Gene expression alterations in prostate cancer predicting tumor aggression and preceding development of malignancy 1. J Clin Oncol 22: 2790–2799
Yuan M, Lin Y (2007) Model selection and estimation in the gaussian graphical model. Biometrika 94: 19–35
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Li, E., Lim, J., Kim, K. et al. Distribution-free tests of mean vectors and covariance matrices for multivariate paired data. Metrika 75, 833–854 (2012). https://doi.org/10.1007/s00184-011-0355-7
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00184-011-0355-7