Graphical Models with R pp 27-49 | Cite as
Log-Linear Models
Abstract
This chapter describes graphical models for multivariate discrete (categorical) data. It starts out by describing various different ways in which such data may be represented in R—for example, as contingency tables—and how to convert between these representations. It then gives a concise exposition of the theory of hierarchical log-linear models, with illustrative examples using the gRim package. Topics covered include log-linear model formulae, dependence graphs, graphical and decomposable models, maximum likelihood estimation using the IPS algorithm, and hypothesis testing. Model selection is briefly discussed and illustrated using a stepwise algorithm. Graphical modeling is particularly useful for multi-dimensional tables, and since these are often sparse, it is necessary to adjust the degrees of freedom as normally calculated. Other topics treated in the chapter include exact conditional tests, ordinal categorical variables, and a comparison of fitting log-linear models using IPS and the glm algorithms. A final section describes some utilities for working with the models.
Keywords
Dependence Graph Conditional Independence Saturated Model Markov Chain Monte Carlo Method Main Effect ModelReferences
- Akaike H (1974) A new look at the statistical identification problem. IEEE Trans Autom Control 19:716–723 MathSciNetMATHCrossRefGoogle Scholar
- Badsberg JH (1991) A guide to CoCo. Tech rep. R-91-43. Department of Mathematics and Computer Science, Aalborg University Google Scholar
- Bishop YMM, Fienberg SE, Holland PW (1975) Discrete multivariate analysis: theory and practice. MIT Press, Cambridge MATHGoogle Scholar
- Edwards D (2000) Introduction to graphical modelling, 2nd edn. Springer, New York MATHCrossRefGoogle Scholar
- Grizzle JE, Starmer CF, Koch GG (1969) Analysis of categorical data by linear models. Biometrics 25(3):489–504 MathSciNetMATHCrossRefGoogle Scholar
- Reiniš Z, Pokorný J, Bazika V, Tišerová J, Goričan K, Horáková D, Stuchlíková E, Havránek T, Hrabovský F (1981) Prognostický význam rizikového profilu v prevenci ischemické choroby srdce. Bratisl Lek Listy 76:137–150 Google Scholar
- Ripley BD (1996) Pattern recognition and neural networks. Cambridge University Press, Cambridge MATHGoogle Scholar
- Schoener TW (1968) The Anolis lizards of Bimini: resource partitioning in a complex fauna. Ecology 49:704–726 CrossRefGoogle Scholar
- Schwarz G (1978) Estimating the dimension of a model. Ann Math Stat 6:461–464 MATHGoogle Scholar
- Whittaker J (1990) Graphical models in applied multivariate statistics. Wiley, Chichester MATHGoogle Scholar