Advertisement

Psychometrika

, Volume 75, Issue 4, pp 649–674 | Cite as

Tests of Homoscedasticity, Normality, and Missing Completely at Random for Incomplete Multivariate Data

  • Mortaza JamshidianEmail author
  • Siavash Jalal
Article

Abstract

Test of homogeneity of covariances (or homoscedasticity) among several groups has many applications in statistical analysis. In the context of incomplete data analysis, tests of homoscedasticity among groups of cases with identical missing data patterns have been proposed to test whether data are missing completely at random (MCAR). These tests of MCAR require large sample sizes n and/or large group sample sizes n i , and they usually fail when applied to nonnormal data. Hawkins (Technometrics 23:105–110, 1981) proposed a test of multivariate normality and homoscedasticity that is an exact test for complete data when n i are small. This paper proposes a modification of this test for complete data to improve its performance, and extends its application to test of homoscedasticity and MCAR when data are multivariate normal and incomplete. Moreover, it is shown that the statistic used in the Hawkins test in conjunction with a nonparametric k-sample test can be used to obtain a nonparametric test of homoscedasticity that works well for both normal and nonnormal data. It is explained how a combination of the proposed normal-theory Hawkins test and the nonparametric test can be employed to test for homoscedasticity, MCAR, and multivariate normality. Simulation studies show that the newly proposed tests generally outperform their existing competitors in terms of Type I error rejection rates. Also, a power study of the proposed tests indicates good power. The proposed methods use appropriate missing data imputations to impute missing data. Methods of multiple imputation are described and one of the methods is employed to confirm the result of our single imputation methods. Examples are provided where multiple imputation enables one to identify a group or groups whose covariance matrices differ from the majority of other groups.

Keywords

covariance structures k-sample test missing data multiple imputation nonparametric test structural equations test of homogeneity of covariances 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Anderson, T.W., & Darling, D.A. (1954). A test of goodness of fit. Journal of the American Statistical Association, 49, 765–769. CrossRefGoogle Scholar
  2. Bentler, P.M., Kim, K.H., & Yuan, K.H. (2004). Testing homogeneity of covariances with infrequent missing data patterns. Unpublished Manuscript. Google Scholar
  3. David, F.N. (1939). On Neyman’s “smooth” test for goodness of fit: I. Distribution of the criterion Ψ2 when the hypothesis tested is true. Biometrika, 31, 191–199. Google Scholar
  4. Fisher, R.A. (1932). Statistical methods for research workers (4th ed.). London: Oliver & Boyd. Google Scholar
  5. Hawkins, D.M. (1981). A new test for multivariate normality and homoscedasticity. Technometrics, 23, 105–110. CrossRefGoogle Scholar
  6. Hou, C.D. (2005). A simple approximation for the distribution of the weighted combination of non-independent or independent probabilities. Statistics and Probability Letters, 73, 179–187. CrossRefGoogle Scholar
  7. Jamshidian, M., & Bentler, P.M. (1999). ML estimation of mean and covariance structures with missing data using complete data routines. Journal of Educational and Behavioral Statistics, 24, 21–41. Google Scholar
  8. Jamshidian, M., & Schott, J. (2007). Testing equality of covariance matrices when data are incomplete. Computational Statistics and Data Analysis, 51, 4227–4239. CrossRefGoogle Scholar
  9. Kim, K.H., & Bentler, P.M. (2002). Tests of homogeneity of means and covariance matrices for multivariate incomplete data. Psychometrika, 67, 609–624. CrossRefGoogle Scholar
  10. Kruskal, W., & Wallis, W. (1952). Use of ranks in one-criterion variance analysis. Journal of the American Statistical Association, 47, 583–621. CrossRefGoogle Scholar
  11. Ledwina, T. (1994). Data-driven version of Neyman’s smooth test of fit. Journal of the American Statistical Association, 89, 1000–1005. CrossRefGoogle Scholar
  12. Little, R.J.A. (1988). A test of missing completely at random for multivariate data with missing values. Journal of the American Statistical Association, 83, 1198–1202. CrossRefGoogle Scholar
  13. Little, R.J.A., & Rubin, D.B. (1987). Statistical analysis with missing data (1st edn.). New York: Wiley. Google Scholar
  14. Little, R.J.A., & Rubin, D.B. (2002). Statistical analysis with missing data (2nd edn.). New York: Wiley. Google Scholar
  15. Marhuenda, Y., Morales, D., & Pardo, M.C. (2005). A comparison of uniformity tests. Statistics, 39, 315–328. CrossRefGoogle Scholar
  16. Neyman, J. (1937). Smooth test for goodness of fit. Skandinavisk Aktuarietidskrift, 20, 150–199. Google Scholar
  17. Rayner, J.C.W., & Best, D.J. (1990). Smooth tests of goodness of fit: An overview. International Statistical Review, 58, 9–17. CrossRefGoogle Scholar
  18. Rubin, D.B. (1976). Inference and missing data. Biometrika, 63, 581–592. CrossRefGoogle Scholar
  19. Rubin, D.B. (1987). Multiple imputation for nonresponse in surveys. New York: Wiley. CrossRefGoogle Scholar
  20. Scholz, F.W., & Stephens, M.A. (1987). K-sample Anderson-Darling tests. Journal of the American Statistical Association, 82, 918–924. CrossRefGoogle Scholar
  21. Srivastava, M.S. (2002). Methods of multivariate statistics. New York: Wiley. Google Scholar
  22. Srivastava, M.S., & Dolatabadi, M. (2009). Multiple imputation and other resampling scheme for imputing missing observations. Journal of Multivariate Analysis, 100, 1919–1937. CrossRefGoogle Scholar
  23. Thas, O., & Ottoy, J.-P. (2004). An extension of the Anderson-Darling k-sample test to arbitrary sample space partition sizes. Journal of Statistical Computation and Simulation, 74, 651–665. CrossRefGoogle Scholar
  24. Yuan, K.H. (2009). Normal distribution pseudo ML for missing data: With applications to mean and covariance structure analysis. Journal of Multivariate Analysis, 100, 1900–1918. CrossRefGoogle Scholar
  25. Yuan, K.H., Bentler, P.M., & Zhang, W. (2005). The effect of skewness and kurtosis on mean and covariance structure analysis: The univariate case and its multivariate implication. Sociological Methods & Research, 34, 240–258. CrossRefGoogle Scholar

Copyright information

© The Psychometric Society 2010

Authors and Affiliations

  1. 1.Department of MathematicsCalifornia State UniversityFullertonUSA

Personalised recommendations