Analysis of unbalanced mixed model data: A case study comparison of ANOVA versus REML/GLS

Littell, Ramon C.

doi:10.1198/108571102816

Analysis of unbalanced mixed model data: A case study comparison of ANOVA versus REML/GLS

Editor’s Invited Paper
Published: December 2002

Volume 7, pages 472–490, (2002)
Cite this article

Journal of Agricultural, Biological, and Environmental Statistics Aims and scope Submit manuscript

Ramon C. Littell¹

1552 Accesses
79 Citations
Explore all metrics

Abstract

Major transition has occurred in recent years in statistical methods for analysis of linear mixed model data from analysis of variance (ANOVA) to likelihood-based methods. Prior to the early 1990s, most applications used some version of analysis of variance because computer software was either not available or not easy to use for likelihood-based methods. ANOVA is based on ordinary least squares computations, with adoptions for mixed models. Computer programs for such methodology were plagued with technical problems of estimability, weighting, and handling missing data. Likelihood-based methods mainly use a combination of residual maximum likelihood (REML) estimation of covariance parameters and generalized least squares (GLS) estimation of mean parameters. Software for REML/GLS methods became readily available early in the 1990s, but the methodology still is not universally embraced. Although many of the computational inadequacies have been overcome, conceptual problems remain. Also, technical problems with REML/GLS have emerged, such as the need for adjustments for effects due to estimating covariance parameters. This article attempts to identify the major problems with ANOVA, describe the problems which remain with REML/GLS, and discuss new problems with REML/GLS.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Damon, R. A., Jr., and Harvey, W. R. (1987), Experimental Design, ANOVA, and Regression, New York: Harper & Row.
Google Scholar
Fai, A. H. T., and Cornelius, P. L. (1996), “Approximate F-tests of Multiple Degree of Freedom Hypotheses in Generalized Least Squares Analysis of Unbalanced Split-Plot Experiments,” Journal of Statistical Computation and Simulation, 54, 363–378.
Article MATH MathSciNet Google Scholar
Giesbrecht, F. G., and Burns, J. C. (1985), “Two-Stage Analysis Based on a Mixed Model: Large-Sample Theory and Small-Sample Simulation Results,” Biometrics, 41, 477–486.
Article MATH Google Scholar
Guerin, L., and Stroup, W. W. (2000), “A Simulation Study to Evaluate PROC MIXED Analysis of Repeated Measures Data,” in Proceedings of the 12th Annual Conference on Applied Statistics in Agriculture, Manhattan, KS: Kansas State University.
Google Scholar
Harvey, W. R. (1982), “Mixed Model Capabilities of LSML76,” Journal of Animal Science, 54, 1279–1285.
Google Scholar
Harville, D. A., and Jeske, D. R. (1992), “Mean Squared Error of Prediction Under a General Linear Model,” Journal of the American Statistical Association, 87, 724–731.
Article MATH MathSciNet Google Scholar
Henderson, C. R. (1975), “Best Linear Estimation and Prediction Under a Selection Model,” Biometrics, 31, 423–449.
Article MATH Google Scholar
— (1984), “Applications of Linear Models in Animal Breeding,” Guelph, Ontario: University of Guelph.
Google Scholar
Jeske, D. R., and Harville, D. A. (1988), “Prediction-Interval Procedures and (Fixed Effect) Confidence-Interval Procedures for Mixed Linear Models,” Communications in Statistics: Theory and Methods, 17 1053–1088.
Article MATH MathSciNet Google Scholar
Kackar, R. N., and Harville, D. A. (1984), “Approximations for Standard Errors of Estimators of Fixed nad Random Effects in Mixed Linear Models,” Journal of the American Statistical Association, 79, 853–862.
Article MATH MathSciNet Google Scholar
Kenward, M. G., and Roger, J. H. (1997), “Small Sample Inference for Fixed Effects from Restricted Maximum Likelihood,” Biometrics, 53, 983–997.
Article MATH Google Scholar
Little, R. J. A., and Rubin, D. B. (1987), Statistical Analysis with Missing Data, New York: Wiley.
MATH Google Scholar
Littell, R. C., Milliken, G. A., Stroup, W. W., and Wolfinger, R. D. (1996), SAS System for Mixed Models, Cary, NC: SAS Institute, Inc.
Google Scholar
Littell, R. C., Pendergast, J., and Natarajan, R. (2000), “Modelling Covariance Structure in the Analysis of Repeated Measures Data,” Statistics in Medicine, 19, 1793–1819.
Article Google Scholar
Littell, R. C., Stroup, W. W., and Freund, R. J. (2002), SAS for Linear Models (4th ed.), Cary, NC: SAS Institute, Inc.
Google Scholar
Milliken, G. A., and Johnson, D. E. (1992), Analysis of Messy Data, Volume 1: Designed Experiments, New York: Chapman and Hall.
Google Scholar
Prasad, N. G. N., and Rao, J. N. K. (1990), “The Estimation of Mean Squared Error of Small-Area Estimators,” Journal of the American Statistical Association, 85, 163–171.
Article MATH MathSciNet Google Scholar
Puntanen, S., and Styan, G. P. H. (1989), “The Equality of the Ordinary Leas Squares Estimator and the Best Linear Unbiased Estimator,” The American Statistician, 43, 153–164.
Article MathSciNet Google Scholar
Rawlings, J. O., Pantula, D. A., and Dickey, D. A. (1998), Applied Regression Analysis New York: Springer.
Book MATH Google Scholar
Remenga, M. D., and Johnson, D. E. (1995), “A Comparison of Inference Procedures in Unbalanced Split-Plot Designs,” Journal of Statistical Computation and Simulation, 51, 353–367.
Article MathSciNet Google Scholar
Satterthwaite, F. W. (1946), “An Approximate Distribution of Estimates of Variance Components,” Biometrics Bulletin, 2, 110–114.
Article Google Scholar
Self, S. G., and Liang, K-Y. (1987), “Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests Under Nonstandard Conditions,” Journal of the American Statistical Association, 82, 605–610.
Article MATH MathSciNet Google Scholar
Speed, F. M., Hocking, R. R., and Hackney, O. P. (1978), “Methods for Analysis of Linear Models with Unbalanced Data,” Journal of the American Statistical Association, 73, 105–112.
Article MATH Google Scholar
Searle, S. R. (1987), Linear Models for Unbalanced Data, New York: Wiley.
Google Scholar
Searle, S. R., Casella, G., and McCulloch, C. E. (1992), Variance Components, New York: Wiley.
Book MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Statistics, Institute of Food and Agricultural Sciences, University of Florida, Box 110339, 32611, Gainesville, FL
Ramon C. Littell (Professor)

Authors

Ramon C. Littell
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ramon C. Littell.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Littell, R.C. Analysis of unbalanced mixed model data: A case study comparison of ANOVA versus REML/GLS. JABES 7, 472–490 (2002). https://doi.org/10.1198/108571102816

Download citation

Issue Date: December 2002
DOI: https://doi.org/10.1198/108571102816

Key Words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Analysis of unbalanced mixed model data: A case study comparison of ANOVA versus REML/GLS

Abstract

Access this article

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Key Words

Search

Navigation