Skip to main content

Robust Estimation of General Linear Mixed Effects Models

  • Chapter
  • First Online:
Robust and Multivariate Statistical Methods

Abstract

The classical REML estimator for fitting a general linear mixed effects model is modified by bounding the terms appearing in the scoring equations. This leads to a generally applicable robust M-type estimator that we call robust scoring equations estimator. It requires only minor assumptions on the covariance matrices (block diagonal for the random effects and diagonal, known up to scale for the residual errors) additional to those of the classical methods. The structure of the data is arbitrary as long as the model is estimable in the classical sense. The estimator can detect and contain the effect of outliers in moderately contaminated datasets. Contamination is detected and treated at all levels of variability of the model, e.g., at both the subject and the observation level for a one-way ANOVA model. The estimator’s properties are studied by simulation and two examples. One example implies crossed random effects, for which the known robust methods are not applicable.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 149.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 199.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Agostinelli, C., & Yohai, V. J. (2016). Composite robust estimators for linear mixed models. Journal of the American Statistical Association, 111(516), 1764–1774.

    Article  MathSciNet  Google Scholar 

  • Agostinelli, C., & Yohai, V. J. (2019). robustvarComp: Robust estimation for variance component models. R package version 0.1-5.

    Google Scholar 

  • Alqallaf, F., Van Aelst, S., Yohai, V. J., & Zamar, R. H. (2009). Propagation of outliers in multivariate data. The Annals of Statistics, 311–331.

    Google Scholar 

  • Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48.

    Article  Google Scholar 

  • Bates, D. M. (2010). lme4: Mixed-Effects Modeling with R. http://lme4.r-forge.r-project.org/book/.

  • Belenky, G., Wesensten, N. J., Thorne, D. R., Thomas, M. L., Sing, H. C., Redmond, D. P., Russo, M. B., & Balkin, T. J. (2003). Patterns of performance degradation and restoration during sleep restriction and subsequent recovery: A sleep dose-response study. Journal of Sleep Research, 12, 1–12.

    Article  Google Scholar 

  • Chervoneva, I., & Vishnyakov, M. (2011). Constrained S-estimators for linear mixed effects models with covariance components. Statistics in Medicine, 30(14), 1735–1750.

    Article  MathSciNet  Google Scholar 

  • Chervoneva, I., & Vishnyakov, M. (2014). Generalized S-estimators for linear mixed effects models. Statistica Sinica, 1257–1276.

    Google Scholar 

  • Copt, S., & Victoria-Feser, M. (2006). High-breakdown inference for mixed linear models. Journal of the American Statistical Association, 101(473), 292–300.

    Article  MathSciNet  MATH  Google Scholar 

  • Davies, O. L., & Goldsmith, P. L., (Eds.) (1972). Statistical methods in research and production (4th edn.) Hafner.

    Google Scholar 

  • Fellner, W. (1986). Robust estimation of variance components. Technometrics, 28(1), 51–60.

    Article  MathSciNet  MATH  Google Scholar 

  • Fernández, C., & Steel, M. F. (1998). On Bayesian modeling of fat tails and skewness. Journal of the American Statistical Association, 93(441), 359–371.

    MathSciNet  MATH  Google Scholar 

  • Geraci, M., & Bottai, M. (2014). Linear quantile mixed models. Statistics and Computing, 24(3), 461–479.

    Article  MathSciNet  MATH  Google Scholar 

  • Heritier, S., Cantoni, E., Copt, S., & Victoria-Feser, M. (2009). Robust methods in biostatistics. John Wiley & Sons.

    Book  MATH  Google Scholar 

  • King, R., & Anderson, E. (2021). skewt: The skewed Student-t distribution. R package version 1.0.

    Google Scholar 

  • Koller, M. (2013). Robust estimation of linear mixed models. Diss., ETH Zürich, Nr. 20997, 2013.

    Google Scholar 

  • Koller, M. (2016). robustlmm: an R package for robust estimation of linear mixed-effects models. Journal of Statistical Software, 75, 1–24.

    Article  Google Scholar 

  • Koller, M. (2022). Replication code for simulation studies. https://CRAN.R-Project.org/package=robustlmm. Vignette included in R package robustlmm, version 3.0.

  • Koller, M., & Stahel, W. A. (2011). Sharpening Wald-type inference in robust regression for small samples. Computational Statistics & Data Analysis, 55(8), 2504–2515.

    Article  MathSciNet  MATH  Google Scholar 

  • Maronna, R. A., Martin, R. D., Yohai, V. J., & Salibián-Barrera, M. (2019). Robust statistics: Theory and methods (with R). John Wiley & Sons.

    MATH  Google Scholar 

  • Mason, F., Cantoni, E., & Ghisletta, P. (2021). Parametric and semi-parametric bootstrap-based confidence intervals for robust linear mixed models. Methodology, 17(4), 271–295.

    Article  Google Scholar 

  • Miller, J. (1977). Asymptotic properties of maximum likelihood estimates in the mixed model of the analysis of variance. The Annals of Statistics, 5(4), 746–762.

    Article  MathSciNet  MATH  Google Scholar 

  • Papritz, A., Künsch, H. R., Schwierz, C., & Stahel, W. A. (2013). Robust geostatistical analysis of spatial data. In EGU General Assembly Conference Abstracts (p. 14145).

    Google Scholar 

  • Pinheiro, J., Liu, C., & Wu, Y. (2001). Efficient algorithms for robust estimation in linear mixed-effects models using the multivariate t distribution. Journal of Computational and Graphical Statistics, 10(2), 249–276.

    Article  MathSciNet  Google Scholar 

  • R Core Team (2014). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing.

    Google Scholar 

  • Richardson, A. (1997). Bounded influence estimation in the mixed linear model. Journal of the American Statistical Association, 92(437).

    Google Scholar 

  • Searle, S. R., Casella, G., & McCulloch, C. E. (1992). Variance components. John Wiley & Sons.

    Book  MATH  Google Scholar 

  • Stahel, W. (1987). Estimation of a covariance matrix with location: Asymptotic formulas and optimal B-robust estimators. Journal of Multivariate Analysis, 22(2), 296–312.

    Article  MathSciNet  MATH  Google Scholar 

  • Stahel, W., & Welsh, A. (1997). Approaches to robust estimation in the simplest variance components model. Journal of Statistical Planning and Inference, 57(2), 295–319.

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgements

The authors would like to thank Kali Tal for providing editorial help with an earlier version of the manuscript.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Manuel Koller .

Editor information

Editors and Affiliations

Appendix

Appendix

1.1 Linear Approximation of Estimated Quantities

In this section, we develop linear approximations to the residuals and the estimated random effects . We use these linear approximations to compute the expected values in the estimating equations as well as the scaling factors τ used in the DAS approach.

Let , , \(\mathbf \psi _e^* = {} \mathbf \psi _e\kern -1pt\left ({\mathbf \varepsilon ^{*}}/{\sigma }\right )\kern +0.5pt/\lambda _e\), \(\mathbf \psi _b^* = \boldsymbol {\Lambda }_b^{-1}\mathbf \psi _b\kern -1pt\left (\mathbf b^{*}/\sigma \right )\kern +0.5pt\), \(\boldsymbol {D}_e =\mathbf {Diag}\kern -1pt\left (\mathbf \psi ^{\prime }_e\kern -1pt\left ({\mathbf \varepsilon ^{*}}/{\sigma }\right )\kern +0.5pt\right )\kern +0.5pt/\lambda _e\), with , , and .

We linearize around β and b, which will be the “true” β and B later on,

Plugging these expressions into the estimating equations (7), divided by λe, and (11) and combining both equations into one yields

where

Using the formula for the inversion of a partitioned matrix, we have

(14)

where

or, equivalently,

Plugging this into (14), we get an approximation for the residuals and for the estimated random effects,

(15)
(16)

with

1.2 Covariance Matrices

The approximations (15) and (16) are used in the computation of covariance matrices. In simpler setups, covariance matrices are calculated on the basis of influence functions IF by integrating . IF is obtained, in the same way as for any M-estimator, from a linear approximation and results proportional to the ψ-function, the factor being the integral of its derivative, \(\lambda = {\mathbf {E}}_{0}\left [\psi ^{\prime }_e\right ]\). Even though we have no rigorous proof for a generalization to our case, we apply this idea here.

The expected values of De and Db are the identity matrices. When these expected values are used as approximations, the matrices M.. and A.. depend only on θ. The calculation of covariance matrices is then straightforward. They will contain the following expectations under the standard normal distribution:

$$\displaystyle \begin{aligned} \gamma_{.}^{(1)} = {\mathbf E}_o\left[z\psi_e\kern-1pt\left(z\right)\kern+0.5pt\right]/\lambda_. \qquad \gamma_{.}^{(2)} = {\mathbf E}_o\left[\psi_e\kern-1pt\left(z\right)\kern+0.5pt^2\right]/\lambda_.^2 \end{aligned}$$

where the dot (.) stands for e or b. The corresponding expressions for the block-diagonal case are and . These are diagonal matrices with entries \(\gamma _{b}^{(1)}\kern -1pt\left (k\kern -1pt\left (j\right )\kern +0.5pt,m_{k\kern -1pt\left (j\right )\kern +0.5pt}\right )\kern +0.5pt\) and \(\gamma _{b}^{(2)}\kern -1pt\left (k\kern -1pt\left (j\right )\kern +0.5pt,m_{k\kern -1pt\left (j\right )\kern +0.5pt}\right )\kern +0.5pt\), respectively, which depend on the dimensions of the blocks k. They are given by

$$\displaystyle \begin{aligned} \gamma_{b}^{(p)}\kern-1pt\left(k,m\right)\kern+0.5pt = m^{-1}{\mathbf E}_{0,m}\left[w_k\kern-1pt\left(u\right)\kern+0.5pt^pu\right]\big/ \lambda_b\kern-1pt\left(k,m\right)\kern+0.5pt \qquad p=1,2 \;. \end{aligned}$$

For fully diagonal Vb, m = 1 and these formulas reduce to \(\gamma _{b}^{(1)}\) and \(\gamma _{b}^{(2)}\).

The covariance matrix of the estimated fixed effects is

(17)

For the derivation of the last equality, we have used the following two identities:

$$\displaystyle \begin{aligned} \begin{array}{rcl} & &\displaystyle \begin{aligned} &\displaystyle {}\boldsymbol{M}_{\beta\beta} \boldsymbol{M}_{XX}\boldsymbol{M}_{\beta\beta} + \boldsymbol{M}_{\beta\beta} \boldsymbol{M}_{XZ} \boldsymbol{M}_{b\beta} = \left(\boldsymbol{M}_{\beta\beta} \boldsymbol{M}_{XX} + \boldsymbol{M}_{\beta b} \boldsymbol{M}_{ZX} \right)\boldsymbol{M}_{\beta\beta} = {} \\ & \left(\boldsymbol{I} + \boldsymbol{M}_{XX}^{-1} \boldsymbol{M}_{XZ} \boldsymbol{M}_{bb} \boldsymbol{M}_{ZX} - \boldsymbol{M}_{XX}^{-1} \boldsymbol{M}_{XZ} \boldsymbol{M}_{bb} \boldsymbol{M}_{ZX} \right)\boldsymbol{M}_{\beta\beta} = \boldsymbol{M}_{\beta\beta} \;,\quad \text{and} \end{aligned} \\[5pt] & &\displaystyle \begin{aligned} &\displaystyle \boldsymbol{M}_{\beta b} \boldsymbol{M}_{ZX} \boldsymbol{M}_{\beta\beta} + \boldsymbol{M}_{\beta b} \boldsymbol{M}_{ZZ} \boldsymbol{M}_{b\beta} = \\ & \boldsymbol{M}_{\beta b} \left(\boldsymbol{M}_{ZX} \boldsymbol{M}_{\beta\beta} - \left(\widehat{\boldsymbol{M}}_{ZZ} - \boldsymbol{D}_b\right) \widehat{\boldsymbol{M}}_{ZZ}^{-1}\boldsymbol{M}_{ZX}\boldsymbol{M}_{\beta\beta} \right) = - \boldsymbol{M}_{\beta b} \boldsymbol{D}_b \boldsymbol{M}_{b\beta}. \end{aligned} \end{array} \end{aligned} $$

For the DAS standardization, we need the covariance matrix of the residuals \(\widehat {\varepsilon }^{*}_i\) and the \(\widehat {b}^{*}_j\),

1.3 Refined Design Adaptive Scale

We first write down the equation determining τe,i for the determination of \(\widehat \sigma \) through (9). The requirement that the ith term in the sum should be zero in expectation translates to the implicit equation

$$\displaystyle \begin{aligned} \int \psi^{(\sigma)}\kern-1pt\left((e-\psi_e\kern-1pt\left(e\right)\kern+0.5pt - r)/\tau_{e,i}\right)\kern+0.5pt \varphi\kern-1pt\left(r/\sigma^{(R)}_i\right)\kern+0.5pt/\sigma^{(R)}_i \mbox dr \varphi\kern-1pt\left(e\right)\kern+0.5pt \mbox de &\\ = \kappa_\sigma\int w^{(\sigma)}\kern-1pt\left((e-\psi_e\kern-1pt\left(e\right)\kern+0.5pt - r)/\tau_{e,i}\right)\kern+0.5pt \varphi\kern-1pt\left(r/\sigma^{(R)}_i\right)\kern+0.5pt/\sigma^{(R)}_i \mbox dr \varphi\kern-1pt\left(e\right)\kern+0.5pt \mbox de & \; \end{aligned}$$

for τi, where \(\psi ^{(\sigma )}\kern -1pt\left (e\right )\kern +0.5pt=e^2w^{(\sigma )}\kern -1pt\left (e\right )\kern +0.5pt\), φ is the standard normal density, \(\sigma ^{(R)}_i\) is the standard deviation of Ri, and κσ is defined below (9). The modification for the variance components θ in the case of diagonal Vb is straightforward.

For random effects with block-diagonal covariance structure, we have

and Tk is determined by

where \(\psi ^{(\eta )}\kern -1pt\left (\mathbf b\right )\kern +0.5pt=\mathbf b\; w^{(\eta )}\kern -1pt\left (\|\mathbf b\|\right )\kern +0.5pt^{1/2}\) and \(\boldsymbol {V}^{(R)}_k\) is the covariance matrix of Rk. (Note that the normalizing constants of the densities cancel.) Integration thus extends over \(2m\kern -1pt\left (k\right )\kern +0.5pt\) dimensions. With this choice of Tk, each term in the sum (12) has approximate expectation zero. To see this, note that . Therefore, multiplying the last equation by from the right and forming the trace proves the result.

The last equation resembles the problem of estimating a robust covariance matrix and can be computed along the same lines.

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Koller, M., Stahel, W.A. (2023). Robust Estimation of General Linear Mixed Effects Models. In: Yi, M., Nordhausen, K. (eds) Robust and Multivariate Statistical Methods. Springer, Cham. https://doi.org/10.1007/978-3-031-22687-8_14

Download citation

Publish with us

Policies and ethics