Generalized Linear Mixed Models for Categorical and Ordinal Responses

Salinas Ruíz, Josafhat; Montesinos López, Osval Antonio; Hernández Ramírez, Gabriela; Crossa Hiriart, Jose

doi:10.1007/978-3-031-32800-8_8

4454 Accesses

Abstract

According to Agresti (2013), a multinomial distribution is a generalization of a binomial distribution in cases with more than two possible ordered (ordinal) or unordered (nominal) outcomes. Given a response with more than two possible outcomes and independent trials with probabilities of similar category for each trial, the distribution of counts across categories follows a multinomial distribution. Quinn and Keough (2002) believe that several methods exist for multinomial data analysis. The most common form of categorical data analysis in biological sciences, which results in frequency counts, is creating cross-tabulations or contingency tables and chi-squared tests to examine associations between two or more categorical variables. However, such an approach is ill suited for a study aimed at estimating the response when there is a change in the explanatory variable(s), as contingency tables are used to analyze the association between variables without considering a predictor or response variable. In this analysis, the results are valid as long as less than 20% of the cells have an expected count less than five and none are less than one (Logan 2010). Fisher’s exact test extends the chi-squared test in studies involving small sample sizes.

You have full access to this open access chapter, Download chapter PDF

8.1 Introduction

According to Agresti (2013), a multinomial distribution is a generalization of a binomial distribution in cases with more than two possible ordered (ordinal) or unordered (nominal) outcomes. Given a response with more than two possible outcomes and independent trials with probabilities of similar category for each trial, the distribution of counts across categories follows a multinomial distribution. Quinn and Keough (2002) believe that several methods exist for multinomial data analysis. The most common form of categorical data analysis in biological sciences, which results in frequency counts, is creating cross-tabulations or contingency tables and chi-squared tests to examine associations between two or more categorical variables. However, such an approach is ill suited for a study aimed at estimating the response when there is a change in the explanatory variable(s), as contingency tables are used to analyze the association between variables without considering a predictor or response variable. In this analysis, the results are valid as long as less than 20% of the cells have an expected count less than five and none are less than one (Logan 2010). Fisher’s exact test extends the chi-squared test in studies involving small sample sizes.

There are several methods for modeling multinomial data; traditional methods of multinomial data analysis include frequency analysis (counts), which uses the chi-squared test and the log-linear model for contingency tables. This chapter focuses on describing multinomial logit and probit models in detail.

8.2 Concepts and Definitions

For the multinomial distribution each observation drawn from a total of N observations belongs to exactly one of the mutually and exclusive c = 1, ⋯, C categories and each category has a probability π_c (c = 1, ⋯, C) of belonging to the category c. A multinomial distribution refers to the probability that exactly one randomly sampled observation from the population belongs to category y₁, that is, it belongs to category 1, y₂ observations belong to category 2, and so forth up to category C,where $ \sum \limits_{c=1}^C{y}_c=N $ and $ \sum \limits_{c=1}^C{\pi}_c=1 $. The density function of this distribution is equal to

$$ f\left({y}_1,{y}_2,\dots, {y}_C\right)=\frac{N!}{y_1!{y}_2!\dots {y}_C!}{\pi}_1^{y_1}{\pi}_2^{y_2}\dots {\pi}_C^{y_c} $$

Multinomial models are applied in data analysis where the categorical response variable has more than two possible outcomes while the independent variables can be continuous, categorical, or both (Hosmer and Lemeshow 2000). The categorical response variable can be either ordinal (ordered) or nominal (unordered). Ordinal response variables are single values that represent a rank order on some dimension, but there are not enough values to be treated as a continuous variable. Nominal (unordered) response variables are those whose values provide a rank but do not provide an indication of order. Models for multinomial data are constructed in a similar way as for binomial data. The link functions used in these types of models are similar to the logit and probit functions used for binomial data. Cumulative logit and cumulative probit models define the link function such that when properly fitted to the data, they allow for parsimonious modeling of ordinal or multinomial data. Generalized logit and probit models do not require ordered categories and are therefore suitable for multinomial nominal data.

In terms of generalized linear models (GLMs) and generalized linear mixed models (GLMMs), a multinomial distribution with C categories requires C − 1 link functions to fully specify a model that relates the response probabilities (π₁, π₂, …, π_C) to the linear predictor. The commonly used models are the cumulative logit model, also known as the proportional odds model proposed by McCullagh (1980), and the cumulative probit model, also known as the threshold model. Throughout this chapter, we will use either of these two link functions interchangeably.

The link functions for a cumulative logit model with C categories are

$$ {\displaystyle \begin{array}{c}{\boldsymbol{\eta}}_1=\log \left(\frac{\pi_1}{1-{\pi}_1}\right)={\eta}_1+\boldsymbol{X}\boldsymbol{\beta } +\boldsymbol{Zb}\\ {}{\boldsymbol{\eta}}_2=\log \left(\frac{\pi_1+{\pi}_2}{1-\left({\pi}_1+{\pi}_2\right)}\right)={\eta}_2+\boldsymbol{X}\boldsymbol{\beta } +\boldsymbol{Zb}\\ {}\vdots \\ {}{\boldsymbol{\eta}}_{\boldsymbol{C}-1}=\log \left(\frac{\pi_1+{\pi}_2+\cdots +{\pi}_{C-1}}{1-\left({\pi}_1+{\pi}_2+\cdots +{\pi}_{C-1}\right)}\right)={\eta}_{C-1}+\boldsymbol{X}\boldsymbol{\beta } +\boldsymbol{Zb}\end{array}} $$

where X and Z are the design matrices, whereas β and b are the vectors of fixed and random effects parameters, respectively. The inverse links of each of the functions are as follows:

$$ {\displaystyle \begin{array}{c}{\pi}_1=\frac{1}{1+{e}^{-{\boldsymbol{\eta}}_1}}=h\left({\boldsymbol{\eta}}_1\right)\\ {}{\pi}_1+{\pi}_2=\frac{1}{1+{e}^{-{\boldsymbol{\eta}}_2}}=h\left({\boldsymbol{\eta}}_2\right)\\ {}\vdots \\ {}{\pi}_1+{\pi}_2+\cdots +{\pi}_{C-1}=\frac{1}{1+{e}^{-{\boldsymbol{\eta}}_{\boldsymbol{c}-1}}}=h\left({\boldsymbol{\eta}}_{\boldsymbol{C}-1}\right).\end{array}} $$

Once h(η₁), h(η₂), ... h(η_c − 1) have been estimated, we can then estimate the probabilities $ {\hat{\pi}}_1 $, $ {\hat{\pi}}_2 $, ..., $ {\hat{\pi}}_c $.

8.3 Cumulative Logit Models (Proportional Odds Models)

Multinomial logit models are used to model the relationships between a polytomous response variable and a set of predictor variables. These polytomous response models can be classified – as mentioned above – into two different types, depending on whether the response variable has an ordered or an unordered structure.

In a proportional odds model, the covariates (linear predictor η) have the same effect on the probabilities that the response variable has in any category when considering different values of the covariates, thus shifting the response distribution to the right (or left) without changing the shape of the distribution. In a proportional odds model, the cumulative logits model the effect of the covariates on the response probabilities below or equal to the category cutoff.

A multinomial logit model assumes independence of categories, which implies that the probabilities of choosing a category c relative to a category c^′ are independent of the category characteristics of c and c^′ for c ≠ c^′. The assumption requires that if a new category is available, then the prior probabilities are precisely adjusted to preserve the original probabilities between all pairs of outcomes. The proportional odds model employs a strict assumption that the odds ratio does not depend on the category, and, therefore, we need to test the proportional odds assumption, which is also called the “parallel regression assumption.”

8.3.1 Complete Randomize Design (CRD) with a Multinomial Response: Ordinal

Data are obtained from an experiment related to red core disease in strawberries, which is caused by the fungus Phytophthora fragariae. In this example, 12 strawberry populations were evaluated in a completely randomized experiment with 4 replications (Table 8.1). Plots generally consisted of 10 plants; in some cases, only 9 plants were observed. At the end of the experiment, each plant was assigned to one of three ordered categories representing fungal damage (1 = no damage, 2 = moderate damage, and 3 = severe damage).

Table 8.1 Evaluation of red core disease in strawberry plants

Data: CRD with a multinomial response: ordinal
Rep	Trt	Cat	Freq	Rep	Trt	Cat	Freq
rep1	M1H1	Without	0	rep1	M2H3	Moderate	3
rep2	M1H1	Without	2	rep2	M2H3	Moderate	1
rep3	M1H1	Without	2	rep3	M2H3	Moderate	3
rep4	M1H1	Without	2	rep4	M2H3	Moderate	2
rep1	M1H2	Without	2	rep1	M2H4	Moderate	4
rep2	M1H2	Without	0	rep2	M2H4	Moderate	2
rep3	M1H2	Without	4	rep3	M2H4	Moderate	2
rep4	M1H2	Without	2	rep4	M2H4	Moderate	5
rep1	M1H3	Without	3	rep1	M2H1	Severe	4
rep2	M1H3	Without	7	rep2	M2H1	Severe	6
rep3	M1H3	Without	1	rep3	M2H1	Severe	7
rep4	M1H3	Without	2	rep4	M2H1	Severe	4
rep1	M1H4	Without	0	rep1	M2H2	Severe	5
rep2	M1H4	Without	5	rep2	M2H2	Severe	2
rep3	M1H4	Without	2	rep3	M2H2	Severe	3
rep4	M1H4	Without	1	rep4	M2H2	Severe	4
rep1	M1H1	Moderate	3	rep1	M2H3	Severe	3
rep2	M1H1	Moderate	2	rep2	M2H3	Severe	4
rep3	M1H1	Moderate	3	rep3	M2H3	Severe	4
rep4	M1H1	Moderate	5	rep4	M2H3	Severe	4
rep1	M1H2	Moderate	3	rep1	M2H4	Severe	5
rep2	M1H2	Moderate	3	rep2	M2H4	Severe	6
rep3	M1H2	Moderate	6	rep3	M2H4	Severe	0
rep4	M1H2	Moderate	3	rep4	M2H4	Severe	3
rep1	M1H3	Moderate	4	rep1	M3H1	Without	0
rep2	M1H3	Moderate	2	rep2	M3H1	Without	3
rep3	M1H3	Moderate	1	rep3	M3H1	Without	2
rep4	M1H3	Moderate	3	rep4	M3H1	Without	0
rep1	M1H4	Moderate	5	rep1	M3H2	Without	5
rep2	M1H4	Moderate	4	rep2	M3H2	Without	3
rep3	M1H4	Moderate	8	rep3	M3H2	Without	3
rep4	M1H4	Moderate	4	rep4	M3H2	Without	2
rep1	M1H1	Severe	6	rep1	M3H3	Without	0
rep2	M1H1	Severe	6	rep2	M3H3	Without	2
rep3	M1H1	Severe	5	rep3	M3H3	Without	1
rep4	M1H1	Severe	3	rep4	M3H3	Without	0
rep1	M1H2	Severe	5	rep1	M3H4	Without	3
rep2	M1H2	Severe	7	rep2	M3H4	Without	5
rep3	M1H2	Severe	0	rep3	M3H4	Without	7

Generalized Linear Mixed Models for Categorical and Ordinal Responses

Abstract

8.1 Introduction

8.2 Concepts and Definitions

8.3 Cumulative Logit Models (Proportional Odds Models)

8.3.1 Complete Randomize Design (CRD) with a Multinomial Response: Ordinal

8.3.2 Randomized Complete Block Design (RCBD) with a Multinomial Response: Ordinal

8.4 Cumulative Probit Models

8.5 Effect of Judges’ Experience on Canned Bean Quality Ratings

8.6 Generalized Logit Models: Nominal Response Variables

8.6.1 CRDs with a Nominal Multinomial Response

8.6.2 CRD: Cheese Tasting

8.7 Exercises

Exercise 8.7.1

Exercise 8.7.2

Exercise 8.7.3

Exercise 8.7.4

Exercise 8.7.5

References

Author information

Authors and Affiliations

Appendix

Appendix

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation