Skip to main content
Log in

Maximum Likelihood Methods in Treating Outliers and Symmetrically Heavy-Tailed Distributions for Nonlinear Structural Equation Models with Missing Data

  • Published:
Psychometrika Aims and scope Submit manuscript

Abstract

By means of more than a dozen user friendly packages, structural equation models (SEMs) are widely used in behavioral, education, social, and psychological research. As the underlying theory and methods in these packages are vulnerable to outliers and distributions with longer-than-normal tails, a fundamental problem in the field is the development of robust methods to reduce the influence of outliers and the distributional deviation in the analysis. In this paper we develop a maximum likelihood (ML) approach that is robust to outliers and symmetrically heavy-tailed distributions for analyzing nonlinear SEMs with ignorable missing data. The analytic strategy is to incorporate a general class of distributions into the latent variables and the error measurements in the measurement and structural equations. A Monte Carlo EM (MCEM) algorithm is constructed to obtain the ML estimates, and a path sampling procedure is implemented to compute the observed-data log-likelihood and then the Bayesian information criterion for model comparison. The proposed methodologies are illustrated with simulation studies and an example.

This is a preview of subscription content, log in via an institution to check access.

Access this article

We’re sorry, something doesn't seem to be working properly.

Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.

Similar content being viewed by others

References

  • Bentler, P.M. (2004). EQS6: Structural equations program manual. Encino, CA: Multivariate Software.

    Google Scholar 

  • Berkane, M., & Bentler, P.M. (1988). Estimating of the contamination parameters and identification of outliers in multivariate data. Sociological Methods & Research, 17, 55–64.

    Article  Google Scholar 

  • Bowman, K.O., & Shenton, L.R. (1988). Properties of estimators for the gamma distribution. New York: Marcel Dekker.

    Google Scholar 

  • Browne, M.W. (1987). Robustness of statistical influence in factor analysis and related models. Biometrika, 74, 375–384.

    Article  Google Scholar 

  • Browne, M.W., & Shapiro, A. (1988). Robustness of normal theory methods in the analysis of linear latent variable models. British Journal of Mathematical and Statistical Psychology, 41, 193–208.

    Google Scholar 

  • Campbell, N.A. (1982). Robust procedure in multivariate analysis I: Robust covariance estimation. Applied Statistics, 29, 231–237.

    Article  Google Scholar 

  • Cowles, M.K. (1996). Accelerating Monte Carlo Markov Chains convergence for cumulative-link generalized linear modes. Statistics and Computing, 6, 101–111.

    Article  Google Scholar 

  • Dempster, A.P., Laird, N.M., & Rubin, D.B. (1977). Maximum likelihood from incomplete data via the EM algorithm (with discussion). Journal of the Royal Statistical Society, Series B, 39, 1–38.

    Google Scholar 

  • Gelman, A., & Meng, X.L. (1998). Simulating normalizing constants: From importance sampling to bridge sampling to path sampling. Statistical Science, 13, 163–185.

    Article  Google Scholar 

  • Geman, S., & Geman, D. (1984). Stochastic relaxation, Gibbs distribution, and the Bayesian restoration of images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 6, 721–741.

    Article  Google Scholar 

  • Hastings, W.K. (1970). Monte Carlo sampling methods using Markov chains and their applications. Biometrika, 57, 97–100.

    Article  Google Scholar 

  • Jöreskog, K.G., & Sörbom, D. (1996). LISREL 8: Structural equation modelling with the SIMPLIS command language. London: Scientific Software International.

    Google Scholar 

  • Kano, Y., Berkane, M., & Bentler, P.M. (1993). Statistical inference based on pseudo-maximum likelihood estimators in elliptical populations. Journal of the American Statistical Association, 88, 135–143.

    Article  Google Scholar 

  • Kass, R.E., & Raftery, A.E. (1995). Bayes factor. Journal of the American Statistical Association, 90, 773–795.

    Article  Google Scholar 

  • Lange, K.L., Little, R.J.A., & Taylor, J. M. G. (1989). Robust statistical modelling using the t-distribution. Journal of the American Statistical Association, 84, 881–896.

    Article  Google Scholar 

  • Lee, M., & Lomax, R.G. (2005). The effects of varying degrees of nonnormality in structural equation modeling. Structural Equation Modeling, 12, 1–27.

    Article  Google Scholar 

  • Lee, S.Y., & Song, X.Y. (2004). Maximum likelihood analysis of a general latent variable model with hierarchically mixed data. Biometrics, 60, 624–636.

    Article  PubMed  Google Scholar 

  • Lee, S.Y., & Song, X.Y. (2003). Model comparison of nonlinear structural equation models with fixed covariates. Psychometrika, 68, 27–47.

    Article  Google Scholar 

  • Lee, S.Y., Song, X.Y., & Lee, J.C.K. (2003). Maximum likelihood estimation of nonlinear structure models with ignorable missing data. Journal of Educational and Behavioral Statistics, 28, 111–134.

    Article  Google Scholar 

  • Lee, S.Y., & Zhu, H.T. (2002). Maximum likelihood estimation of nonlinear structural equation models. Psychometrika, 67, 189–210.

    Article  Google Scholar 

  • Little, R.J.A. (1988). Robust estimation of mean and covariance matrix from data with missing values. Applied Statistics, 37, 23–39.

    Article  Google Scholar 

  • Little, R.J.A., & Rubin, D.B. (1987). Statistical analysis with missing data. New York: Wiley.

    Google Scholar 

  • Louis, T.A. (1982). Finding the observed information matrix when using the EM algorithm. Journal of the Royal Statistical Society, Series B, 44, 226–233.

    Google Scholar 

  • Mardia, V.V. (1970). Measures of multivariate skewness and kurtosis with application. Biometrika, 57, 519–530.

    Article  Google Scholar 

  • Meng, X.L., & Rubin, D.B. (1993). Maximum likelihood estimation via the ECM algorithm: A general framework. Biometrika, 80, 267–278.

    Article  Google Scholar 

  • Metropolis, N., Rosenbluth, A.W., Rosenbluth, M.N., Teller, A.H., & Teller, E. (1953). Equations of state calculations by fast computing machines. Journal of Chemical Physics, 21, 1087–1092.

    Article  Google Scholar 

  • Ogasawara, H. (2005). Asymptotic robustness of the asymptotic bias in structural equation modeling. Computational Statistics & Data Analysis, 49, 771–783.

    Article  Google Scholar 

  • Schwarz, G. (1978). Estimating the dimension of a model. Annals of Statistics, 6, 461–464.

    Article  Google Scholar 

  • Song, X.Y., & Lee, S.Y. (2005). Maximum likelihood analysis of nonlinear structural equation models with dichotomons variables. Multivariate Behavioral Research, 40, 151–177.

    Article  Google Scholar 

  • Song, X.Y., & Lee, S.Y. (2004). Bayesian analysis of two-level nonlinear structural equation models with continuous and polytomous data. British Journal of Mathematical and Statistical Psychology, 57, 29–52.

    Article  PubMed  Google Scholar 

  • Watanabe, M., & Yamaguchi, K. (2004). The EM algorithm and relate statistical models. New York: Marcel Dekker.

    Google Scholar 

  • Wei, G.C.G., & Tanner, M.A. (1990). Monte Carlo implementation of the EM algorithm and the poor man’s data augmentation algorithms (in theory and methods). Journal of the American Statistical Association, 85, 699–704.

    Article  Google Scholar 

  • Yuan, K.H., & Bentler, P.M. (1997). Mean and covariance structure analysis: theoretical and practical improvements (in theory and methods). Journal of the American Statistical Association, 92, 767–774.

    Article  Google Scholar 

  • Yuan, K.H., & Bentler, P.M. (1998a). Robust mean and covariance structure analysis. British Journal of Mathematical and Statistical Psychology, 51, 63–88.

    Google Scholar 

  • Yuan, K.H., & Bentler, P.M. (1998b). Structural equation modelling with robust covariance. Sociological Methodology, 28, 363–396.

    Article  Google Scholar 

  • Yuan, K.H., & Bentler, P.M. (2000). Robust mean and covariance structure analysis through iteratively reweighted least squares. Psychometrika, 65, 43–58.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sik-Yum Lee.

Additional information

The research described herein was fully supported by a grant (CUHK 4243/03H) from the Rearch Grants Council of the Hong Kong Special Administration Region. The authors are thankful to the Editor, the Associate Editor, and anonymous reviewers for valuable comments which improve the paper significantly, and are grateful to ICPSR and the relevant funding agency for allowing the use of their data.

Requests for reprints should be sent to S. Y. Lee, Department of Statistics, The Chinese University of Hong Kong, Shatin, N. T., Hong Kong.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lee, SY., Xia, YM. Maximum Likelihood Methods in Treating Outliers and Symmetrically Heavy-Tailed Distributions for Nonlinear Structural Equation Models with Missing Data. Psychometrika 71, 565–585 (2006). https://doi.org/10.1007/s11336-006-1264-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11336-006-1264-1

Keywords

Navigation