Robust Estimation of General Linear Mixed Effects Models

Koller, Manuel; Stahel, Werner A.

doi:10.1007/978-3-031-22687-8_14

Manuel Koller^3,4 &
Werner A. Stahel⁵

561 Accesses

Abstract

The classical REML estimator for fitting a general linear mixed effects model is modified by bounding the terms appearing in the scoring equations. This leads to a generally applicable robust M-type estimator that we call robust scoring equations estimator. It requires only minor assumptions on the covariance matrices (block diagonal for the random effects and diagonal, known up to scale for the residual errors) additional to those of the classical methods. The structure of the data is arbitrary as long as the model is estimable in the classical sense. The estimator can detect and contain the effect of outliers in moderately contaminated datasets. Contamination is detected and treated at all levels of variability of the model, e.g., at both the subject and the observation level for a one-way ANOVA model. The estimator’s properties are studied by simulation and two examples. One example implies crossed random effects, for which the known robust methods are not applicable.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Hardcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agostinelli, C., & Yohai, V. J. (2016). Composite robust estimators for linear mixed models. Journal of the American Statistical Association, 111(516), 1764–1774.
Article MathSciNet Google Scholar
Agostinelli, C., & Yohai, V. J. (2019). robustvarComp: Robust estimation for variance component models. R package version 0.1-5.
Google Scholar
Alqallaf, F., Van Aelst, S., Yohai, V. J., & Zamar, R. H. (2009). Propagation of outliers in multivariate data. The Annals of Statistics, 311–331.
Google Scholar
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48.
Article Google Scholar
Bates, D. M. (2010). lme4: Mixed-Effects Modeling with R. http://lme4.r-forge.r-project.org/book/.
Belenky, G., Wesensten, N. J., Thorne, D. R., Thomas, M. L., Sing, H. C., Redmond, D. P., Russo, M. B., & Balkin, T. J. (2003). Patterns of performance degradation and restoration during sleep restriction and subsequent recovery: A sleep dose-response study. Journal of Sleep Research, 12, 1–12.
Article Google Scholar
Chervoneva, I., & Vishnyakov, M. (2011). Constrained S-estimators for linear mixed effects models with covariance components. Statistics in Medicine, 30(14), 1735–1750.
Article MathSciNet Google Scholar
Chervoneva, I., & Vishnyakov, M. (2014). Generalized S-estimators for linear mixed effects models. Statistica Sinica, 1257–1276.
Google Scholar
Copt, S., & Victoria-Feser, M. (2006). High-breakdown inference for mixed linear models. Journal of the American Statistical Association, 101(473), 292–300.
Article MathSciNet MATH Google Scholar
Davies, O. L., & Goldsmith, P. L., (Eds.) (1972). Statistical methods in research and production (4th edn.) Hafner.
Google Scholar
Fellner, W. (1986). Robust estimation of variance components. Technometrics, 28(1), 51–60.
Article MathSciNet MATH Google Scholar
Fernández, C., & Steel, M. F. (1998). On Bayesian modeling of fat tails and skewness. Journal of the American Statistical Association, 93(441), 359–371.
MathSciNet MATH Google Scholar
Geraci, M., & Bottai, M. (2014). Linear quantile mixed models. Statistics and Computing, 24(3), 461–479.
Article MathSciNet MATH Google Scholar
Heritier, S., Cantoni, E., Copt, S., & Victoria-Feser, M. (2009). Robust methods in biostatistics. John Wiley & Sons.
Book MATH Google Scholar
King, R., & Anderson, E. (2021). skewt: The skewed Student-t distribution. R package version 1.0.
Google Scholar
Koller, M. (2013). Robust estimation of linear mixed models. Diss., ETH Zürich, Nr. 20997, 2013.
Google Scholar
Koller, M. (2016). robustlmm: an R package for robust estimation of linear mixed-effects models. Journal of Statistical Software, 75, 1–24.
Article Google Scholar
Koller, M. (2022). Replication code for simulation studies. https://CRAN.R-Project.org/package=robustlmm. Vignette included in R package robustlmm, version 3.0.
Koller, M., & Stahel, W. A. (2011). Sharpening Wald-type inference in robust regression for small samples. Computational Statistics & Data Analysis, 55(8), 2504–2515.
Article MathSciNet MATH Google Scholar
Maronna, R. A., Martin, R. D., Yohai, V. J., & Salibián-Barrera, M. (2019). Robust statistics: Theory and methods (with R). John Wiley & Sons.
MATH Google Scholar
Mason, F., Cantoni, E., & Ghisletta, P. (2021). Parametric and semi-parametric bootstrap-based confidence intervals for robust linear mixed models. Methodology, 17(4), 271–295.
Article Google Scholar
Miller, J. (1977). Asymptotic properties of maximum likelihood estimates in the mixed model of the analysis of variance. The Annals of Statistics, 5(4), 746–762.
Article MathSciNet MATH Google Scholar
Papritz, A., Künsch, H. R., Schwierz, C., & Stahel, W. A. (2013). Robust geostatistical analysis of spatial data. In EGU General Assembly Conference Abstracts (p. 14145).
Google Scholar
Pinheiro, J., Liu, C., & Wu, Y. (2001). Efficient algorithms for robust estimation in linear mixed-effects models using the multivariate t distribution. Journal of Computational and Graphical Statistics, 10(2), 249–276.
Article MathSciNet Google Scholar
R Core Team (2014). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing.
Google Scholar
Richardson, A. (1997). Bounded influence estimation in the mixed linear model. Journal of the American Statistical Association, 92(437).
Google Scholar
Searle, S. R., Casella, G., & McCulloch, C. E. (1992). Variance components. John Wiley & Sons.
Book MATH Google Scholar
Stahel, W. (1987). Estimation of a covariance matrix with location: Asymptotic formulas and optimal B-robust estimators. Journal of Multivariate Analysis, 22(2), 296–312.
Article MathSciNet MATH Google Scholar
Stahel, W., & Welsh, A. (1997). Approaches to robust estimation in the simplest variance components model. Journal of Statistical Planning and Inference, 57(2), 295–319.
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

The authors would like to thank Kali Tal for providing editorial help with an earlier version of the manuscript.

Author information

Authors and Affiliations

Institute of Social and Preventive Medicine, University of Bern, Bern, Switzerland
Manuel Koller
Seminar für Statistik, ETH Zürich, Zürich, Switzerland
Manuel Koller
Seminar für Statistik, ETH Zürich, Zürich, Switzerland
Werner A. Stahel

Authors

Manuel Koller
View author publications
You can also search for this author in PubMed Google Scholar
Werner A. Stahel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Manuel Koller .

Editor information

Editors and Affiliations

School of Statistics, Beijing Normal University, Beijing, China
Mengxi Yi
Department of Mathematics and Statistics, University of Jyväskylä, Jyväskylä, Finland
Klaus Nordhausen

Appendix

1.1 Linear Approximation of Estimated Quantities

In this section, we develop linear approximations to the residuals and the estimated random effects . We use these linear approximations to compute the expected values in the estimating equations as well as the scaling factors τ used in the DAS approach.

Let , , $\mathbf \psi _e^* = {} \mathbf \psi _e\kern -1pt\left ({\mathbf \varepsilon ^{*}}/{\sigma }\right )\kern +0.5pt/\lambda _e$, $\mathbf \psi _b^* = \boldsymbol {\Lambda }_b^{-1}\mathbf \psi _b\kern -1pt\left (\mathbf b^{*}/\sigma \right )\kern +0.5pt$, $\boldsymbol {D}_e =\mathbf {Diag}\kern -1pt\left (\mathbf \psi ^{\prime }_e\kern -1pt\left ({\mathbf \varepsilon ^{*}}/{\sigma }\right )\kern +0.5pt\right )\kern +0.5pt/\lambda _e$, with , , and .

We linearize around β and b^∗, which will be the “true” β and B^∗ later on,

Plugging these expressions into the estimating equations (7), divided by λ_e, and (11) and combining both equations into one yields

where

Using the formula for the inversion of a partitioned matrix, we have

(14)

where

or, equivalently,

Plugging this into (14), we get an approximation for the residuals and for the estimated random effects,

(15)

(16)

with

1.2 Covariance Matrices

The approximations (15) and (16) are used in the computation of covariance matrices. In simpler setups, covariance matrices are calculated on the basis of influence functions IF by integrating . IF is obtained, in the same way as for any M-estimator, from a linear approximation and results proportional to the ψ-function, the factor being the integral of its derivative, $\lambda = {\mathbf {E}}_{0}\left [\psi ^{\prime }_e\right ]$. Even though we have no rigorous proof for a generalization to our case, we apply this idea here.

The expected values of D_e and D_b are the identity matrices. When these expected values are used as approximations, the matrices M_.. and A_.. depend only on θ. The calculation of covariance matrices is then straightforward. They will contain the following expectations under the standard normal distribution:

$$\displaystyle \begin{aligned} \gamma_{.}^{(1)} = {\mathbf E}_o\left[z\psi_e\kern-1pt\left(z\right)\kern+0.5pt\right]/\lambda_. \qquad \gamma_{.}^{(2)} = {\mathbf E}_o\left[\psi_e\kern-1pt\left(z\right)\kern+0.5pt^2\right]/\lambda_.^2 \end{aligned}$$

where the dot (_.) stands for e or b. The corresponding expressions for the block-diagonal case are and . These are diagonal matrices with entries $\gamma _{b}^{(1)}\kern -1pt\left (k\kern -1pt\left (j\right )\kern +0.5pt,m_{k\kern -1pt\left (j\right )\kern +0.5pt}\right )\kern +0.5pt$ and $\gamma _{b}^{(2)}\kern -1pt\left (k\kern -1pt\left (j\right )\kern +0.5pt,m_{k\kern -1pt\left (j\right )\kern +0.5pt}\right )\kern +0.5pt$, respectively, which depend on the dimensions of the blocks k. They are given by

$$\displaystyle \begin{aligned} \gamma_{b}^{(p)}\kern-1pt\left(k,m\right)\kern+0.5pt = m^{-1}{\mathbf E}_{0,m}\left[w_k\kern-1pt\left(u\right)\kern+0.5pt^pu\right]\big/ \lambda_b\kern-1pt\left(k,m\right)\kern+0.5pt \qquad p=1,2 \;. \end{aligned}$$

For fully diagonal V_b, m = 1 and these formulas reduce to $\gamma _{b}^{(1)}$ and $\gamma _{b}^{(2)}$.

The covariance matrix of the estimated fixed effects is

(17)

For the derivation of the last equality, we have used the following two identities:

$$\displaystyle \begin{aligned} \begin{array}{rcl} & &\displaystyle \begin{aligned} &\displaystyle {}\boldsymbol{M}_{\beta\beta} \boldsymbol{M}_{XX}\boldsymbol{M}_{\beta\beta} + \boldsymbol{M}_{\beta\beta} \boldsymbol{M}_{XZ} \boldsymbol{M}_{b\beta} = \left(\boldsymbol{M}_{\beta\beta} \boldsymbol{M}_{XX} + \boldsymbol{M}_{\beta b} \boldsymbol{M}_{ZX} \right)\boldsymbol{M}_{\beta\beta} = {} \\ & \left(\boldsymbol{I} + \boldsymbol{M}_{XX}^{-1} \boldsymbol{M}_{XZ} \boldsymbol{M}_{bb} \boldsymbol{M}_{ZX} - \boldsymbol{M}_{XX}^{-1} \boldsymbol{M}_{XZ} \boldsymbol{M}_{bb} \boldsymbol{M}_{ZX} \right)\boldsymbol{M}_{\beta\beta} = \boldsymbol{M}_{\beta\beta} \;,\quad \text{and} \end{aligned} \\[5pt] & &\displaystyle \begin{aligned} &\displaystyle \boldsymbol{M}_{\beta b} \boldsymbol{M}_{ZX} \boldsymbol{M}_{\beta\beta} + \boldsymbol{M}_{\beta b} \boldsymbol{M}_{ZZ} \boldsymbol{M}_{b\beta} = \\ & \boldsymbol{M}_{\beta b} \left(\boldsymbol{M}_{ZX} \boldsymbol{M}_{\beta\beta} - \left(\widehat{\boldsymbol{M}}_{ZZ} - \boldsymbol{D}_b\right) \widehat{\boldsymbol{M}}_{ZZ}^{-1}\boldsymbol{M}_{ZX}\boldsymbol{M}_{\beta\beta} \right) = - \boldsymbol{M}_{\beta b} \boldsymbol{D}_b \boldsymbol{M}_{b\beta}. \end{aligned} \end{array} \end{aligned} $$

For the DAS standardization, we need the covariance matrix of the residuals $\widehat {\varepsilon }^{*}_i$ and the $\widehat {b}^{*}_j$,

1.3 Refined Design Adaptive Scale

We first write down the equation determining τ_e,i for the determination of $\widehat \sigma $ through (9). The requirement that the ith term in the sum should be zero in expectation translates to the implicit equation

$$\displaystyle \begin{aligned} \int \psi^{(\sigma)}\kern-1pt\left((e-\psi_e\kern-1pt\left(e\right)\kern+0.5pt - r)/\tau_{e,i}\right)\kern+0.5pt \varphi\kern-1pt\left(r/\sigma^{(R)}_i\right)\kern+0.5pt/\sigma^{(R)}_i \mbox dr \varphi\kern-1pt\left(e\right)\kern+0.5pt \mbox de &\\ = \kappa_\sigma\int w^{(\sigma)}\kern-1pt\left((e-\psi_e\kern-1pt\left(e\right)\kern+0.5pt - r)/\tau_{e,i}\right)\kern+0.5pt \varphi\kern-1pt\left(r/\sigma^{(R)}_i\right)\kern+0.5pt/\sigma^{(R)}_i \mbox dr \varphi\kern-1pt\left(e\right)\kern+0.5pt \mbox de & \; \end{aligned}$$

for τ_i, where $\psi ^{(\sigma )}\kern -1pt\left (e\right )\kern +0.5pt=e^2w^{(\sigma )}\kern -1pt\left (e\right )\kern +0.5pt$, φ is the standard normal density, $\sigma ^{(R)}_i$ is the standard deviation of R_i, and κ_σ is defined below (9). The modification for the variance components θ_ℓ in the case of diagonal V_b is straightforward.

For random effects with block-diagonal covariance structure, we have

and T_k is determined by

where $\psi ^{(\eta )}\kern -1pt\left (\mathbf b\right )\kern +0.5pt=\mathbf b\; w^{(\eta )}\kern -1pt\left (\|\mathbf b\|\right )\kern +0.5pt^{1/2}$ and $\boldsymbol {V}^{(R)}_k$ is the covariance matrix of R_k. (Note that the normalizing constants of the densities cancel.) Integration thus extends over $2m\kern -1pt\left (k\right )\kern +0.5pt$ dimensions. With this choice of T_k, each term in the sum (12) has approximate expectation zero. To see this, note that . Therefore, multiplying the last equation by from the right and forming the trace proves the result.

The last equation resembles the problem of estimating a robust covariance matrix and can be computed along the same lines.

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Koller, M., Stahel, W.A. (2023). Robust Estimation of General Linear Mixed Effects Models. In: Yi, M., Nordhausen, K. (eds) Robust and Multivariate Statistical Methods. Springer, Cham. https://doi.org/10.1007/978-3-031-22687-8_14

Download citation

DOI: https://doi.org/10.1007/978-3-031-22687-8_14
Published: 26 November 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-22686-1
Online ISBN: 978-3-031-22687-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Robust Estimation of General Linear Mixed Effects Models

Abstract

Access this chapter

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendix

Appendix

1.1 Linear Approximation of Estimated Quantities

1.2 Covariance Matrices

1.3 Refined Design Adaptive Scale

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation