Skip to main content
Log in

Analyzing sequential categorical data: Individual variation in markov chains

  • Published:
Psychometrika Aims and scope Submit manuscript

Abstract

Markov chains are probabilistic models for sequences of categorical events, with applications throughout scientific psychology. This paper provides a method for anlayzing data consisting of event sequences and covariate observations. It is assumed that each sequence is a Markov process characterized by a distinct transition probability matrix. The objective is to use the covariate data to explain differences between individuals in the transition probability matrices characterizing their sequential data. The elements of the transition probability matrices are written as functions of a vector of latent variables, with variation in the latent variables explained through a multivariate regression on the covariates. The regression is estimated using the EM algorithm, and requires the numerical calculation of a multivariate integral. An example using simulated cognitive developmental data is presented, which shows that the estimation of individual variation in the parameters of a probability model may have substantial theoretical importance, even when individual differences are not the focus of the investigator's concerns.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Anderson, T., & Goodman, L. (1957). Statistical inference about Markov chains.Annals of Mathematical Statistics, 28, 89–110.

    Google Scholar 

  • Atkinson, A. C. (1985).Plots, transformations, and regression: An introduction to graphical methods of diagnositc regression analysis. New York: Oxford University Press.

    Google Scholar 

  • Bakeman, R., & Gottman, J. (1986).Observing Interaction. Cambridge: Cambridge University Press.

    Google Scholar 

  • Bentler, P., & Lee, S. (1975). Some extensions of matrix calculus.General Systems, 20, 145–150.

    Google Scholar 

  • Bertler, P., & Tanaka, J. (1983). Problems with the EM algorithm for ML factor analysis.Psychometrika, 48, 247–252.

    Google Scholar 

  • Brainerd, C., Howe, M., & Desrochers, A. (1982). The general theory of two-stage learning: A mathematical review with illustrations from memory development.Psychological Bulletin, 91, 634–665.

    Google Scholar 

  • Budescu, D. (1987). A Markov model for generation of random binary sequences.Journal of Experimental Psychology: Human Perception and Performance, 13, 25–39.

    Google Scholar 

  • Chi, M., Feltovich, P., & Glaser, R. (1981). Categorization and representation of physics problems by novices.Cognitive Science, 5, 121–152.

    Google Scholar 

  • Çinlar, E. (1975).Introduction to stochastic processes. Englewood Cliffs, NJ: Prentice Hall.

    Google Scholar 

  • Dempster, A., Laird, N., & Rubin, D. (1977). Maximum likelihood from incomplete data via the EM algorithm with discussion.Journal of the Royal Statistical Society, Series B,39, 1–38.

    Google Scholar 

  • Edlefsen, E., & Jones, S. (1986).GAUSS programming language manual. Kent, WA: Aptech Systems.

    Google Scholar 

  • Gardner, W. (1989).Analysis of individual variation in Markov chains (Tech. Rep. No. 87-8). Charlottesville: University of Virginia, Department of Psychology.

    Google Scholar 

  • Gardner, W., & Hartmann, D. (1984). Markov analysis of social interaction.Behavioral Assessment, 6, 229–236.

    Google Scholar 

  • Gottman, J. (1979). Detecting cyclicity in social interaction.Psychological Bulletin, 36, 338–348.

    Google Scholar 

  • Gottman, J., & Roy, A. (1989).Sequential analysis: A guide for behavioral researchers. New York: Cambridge University Press.

    Google Scholar 

  • Greeno, J. (1974). Representation of learning as a discrete transition in a finite state space. In D. Krantz, R. Atkinson, R. Luce, & P. Suppes (Eds.),Contemporary developments in mathematical psychology: Vol. 1: Learning, memory, and thinking. San Francisco: W. H. Freeman.

    Google Scholar 

  • Hanushek, E., & Jackson, J. (1977).Statistical methods for social scientists. New York: Acadmeic Press.

    Google Scholar 

  • Heckman, J., & Singer, B. (1982). Population heterogeneity in demographic models. In K. Land & A. Rogers (Eds.),Multidimensional mathematical demography. New York: Academic Press.

    Google Scholar 

  • Louis, T. (1982). Finding the observed information matrix when using the EM algorithm.Journal of the Royal Statistical Society, Series B,44, 226–233.

    Google Scholar 

  • Schotenberg, R. (1985). Latent variables in the analysis of limited dependent variables. In N. Tuma (Ed.),Sociological methodology 1985. San Francisco: Jossey-Bass.

    Google Scholar 

  • Townsend, J., & Ashby, G. (1983).The stochastic modeling of elementary psychological processes. New York: Cambridge University Press.

    Google Scholar 

  • White, H. (1982). Maximum likelihood estimation of misspecified models.econometrica, 50, 1–25.

    Google Scholar 

  • Wilkinson, A. (1982). Partial knowledge and self correction: Developmental studies of a quantitative concept.Development Psychology, 18, 876–893.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Additional information

Research contributing to this article was supported by B.R.S. Subgrant 5-35345 from the University of Virginia. I thank the DADA Group, Bill Fabricius, Don Hartmann, William Griffin, Jack McArdle, Ivo Molenaar, Ronald Schoenberg, Simon Tavaré, and several anonymous reviewers for their discussion of these points.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gradner, W. Analyzing sequential categorical data: Individual variation in markov chains. Psychometrika 55, 263–275 (1990). https://doi.org/10.1007/BF02295287

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02295287

Key words

Navigation