Skip to main content

A Method for Analyzing Longitudinal Outcomes with Many Zeros

Abstract

Health care utilization and cost data have challenged analysts because they are often correlated over time, highly skewed, and clumped at 0. Traditional approaches do not address all these problems, and evaluators of mental health and substance abuse interventions often grapple with the problem of how to analyze these data in a way that accurately represents program impact. Recently, the traditional 2-part model has been extended to mixed-effects mixed-distribution model with correlated random effects to deal simultaneously with excess zeros, skewness, and correlated observations. We introduce and demonstrate this new method to mental health services researchers and evaluators by analyzing the data from a study of assertive community treatment (ACT). The response variable is the number of days of hospitalization, collected every 6 months over 3 years. The explanatory variable is group: ACT vs. standard case management. Diagnosis (schizophrenia vs. bipolar disorder), time, and the baseline values of hospital days are covariates. Results indicate that clients in the ACT group have a higher probability of hospital admission, but tend to have shorter lengths of stay. The mixed-distribution model provides greater specification of a model to fit these data and leads to more refined interpretation of the results.

This is a preview of subscription content, access via your institution.

REFERENCES

  • Drake, R., McHugo, G., Clark, R., Teague, G., Xie, H., & Miles, K. (1998). Assertive community treatment for patients with co-occurring severe mental illness and substance abuse disorder: A clinical trial. American Journal of Orthopsychiatry, 68(2), 201–215.

    Google Scholar 

  • Duan, N., Manning, W. G., Morris, C. N., & Newhouse, J. P. (1983). A comparison of alternative models for the demand for medical care. Journal of Economic and Business Statistics, 1, 115–126.

    Google Scholar 

  • Green, W. (1994). Accounting for excess zeros and sample selection in Poisson and negative binomial regression models.(Working paper EC-94-10). New York: New York University, Department of Economics.

    Google Scholar 

  • Grunwald, G. K., & Jones, R. H. (2000). Markov models for time series with mixed distribution. Environmetrics, 11, 7–339.

    Google Scholar 

  • Hajivassiliou, V. A. (1994). Asimulation estimation analysis of the external debt crises of developing countries. Journal of Applied Econometrics, 9,1–131.

    Google Scholar 

  • Hall, D. B. (2000). Zero-inflated Poisson and binomial regression with random effects: A case study. Biometrics, 56, 1030–1039.

    Google Scholar 

  • Heckman, J. (1974). Shadow prices, market wages, and labor supply. Econometrica, 42, 674–679.

    Google Scholar 

  • Heckman, J. (1976). The common structure of statistical models of truncation, sample selection, and limited dependent varibales, and a sample estimator for such models. The Annals of Economic and Social Measurement, 5, 475–592.

    Google Scholar 

  • Hur, K. (1999). A random-effects zero-inflated Poisson regression model for clustered extra-zero counts. Unpublished PhD Dessertation, University of Illinois at Chicago.

  • Lachenbruch, P. (1976). Analysis of data with clumping at zero. Biometrische Zeitschrift, 18, 351–356.

    Google Scholar 

  • Lachenbruch, P. (1992). Utility of regression analysis in epidemiologic studies of the elderly. In W. R. Wallace (Ed.), The epidemiologic study of the elderly (pp. 371–381). Oxford, UK: Oxford University Press.

    Google Scholar 

  • Laird, N. M., & Ware, H. (1982). Random-effect models for longitudinal data. Biometrics, 38, 963–974.

    Google Scholar 

  • Lambert, D. (1992). Zero-inflated Poisson regression, with an application to detects in manufacturing. Technometrics, 34, 1–14.

    Google Scholar 

  • Manning, W. G., Duan, N., & Rogers, W. H. (1987). Monte Carlo evidence on the choice between sample selection and two-part models. Journal of Econometrics, 35, 59–82.

    Google Scholar 

  • Muthen, L. K., & Muthen, B. O. (2004). Mplus User's Guide (Version Three). Los Angeles, CA: Muthen & Muthen.

    Google Scholar 

  • Olsen, M., & Schafer, J. (2001). A two-part random-effects model for semicontinuous longitudinal data. Journal of American Statistical Association, 96, 730–745.

    Google Scholar 

  • Tobin, J. (1958). Estimation of relationships for limited dependent variables. Econometrica, 26, 24–36.

    Google Scholar 

  • Tooze, A. J., Grunwald, G. K., & Jones, R. H. (2002). Analysis of repeated measures data with clumping at zero. Statistical Methods in Medical Research, 11, 341–355.

    Google Scholar 

  • Xie, H., Hur, K., & McHugo, G. (2000). Using random-effects zero-inflated Poisson model to analyze longitudinal count data with extra zeros. Paper presented at the 53rd Session of the Inter-national Statistical Institute, Seoul, South Korea.

  • Yau, K., & Lee, A. (2001). Zero-inflated Poisson regression with random effects to evaluate an occupational injury prevention program. Statistics in Medicine, 20, 2907–2920.

    Google Scholar 

  • Zeger, S., Liang, K., & Albert, P. (1988). Models for longitudinal data: A generalized estimating equation approach. Biometrics, 44, 1049–1060.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and Permissions

About this article

Cite this article

Xie, H., McHugo, G., Sengupta, A. et al. A Method for Analyzing Longitudinal Outcomes with Many Zeros. Ment Health Serv Res 6, 239–246 (2004). https://doi.org/10.1023/B:MHSR.0000044749.39484.1b

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/B:MHSR.0000044749.39484.1b

  • mixed distribution
  • excess zeros
  • two-part model
  • repeated measures
  • assertive community treatment