Abstract
The classical REML estimator for fitting a general linear mixed effects model is modified by bounding the terms appearing in the scoring equations. This leads to a generally applicable robust M-type estimator that we call robust scoring equations estimator. It requires only minor assumptions on the covariance matrices (block diagonal for the random effects and diagonal, known up to scale for the residual errors) additional to those of the classical methods. The structure of the data is arbitrary as long as the model is estimable in the classical sense. The estimator can detect and contain the effect of outliers in moderately contaminated datasets. Contamination is detected and treated at all levels of variability of the model, e.g., at both the subject and the observation level for a one-way ANOVA model. The estimator’s properties are studied by simulation and two examples. One example implies crossed random effects, for which the known robust methods are not applicable.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agostinelli, C., & Yohai, V. J. (2016). Composite robust estimators for linear mixed models. Journal of the American Statistical Association, 111(516), 1764–1774.
Agostinelli, C., & Yohai, V. J. (2019). robustvarComp: Robust estimation for variance component models. R package version 0.1-5.
Alqallaf, F., Van Aelst, S., Yohai, V. J., & Zamar, R. H. (2009). Propagation of outliers in multivariate data. The Annals of Statistics, 311–331.
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, 67(1), 1–48.
Bates, D. M. (2010). lme4: Mixed-Effects Modeling with R. http://lme4.r-forge.r-project.org/book/.
Belenky, G., Wesensten, N. J., Thorne, D. R., Thomas, M. L., Sing, H. C., Redmond, D. P., Russo, M. B., & Balkin, T. J. (2003). Patterns of performance degradation and restoration during sleep restriction and subsequent recovery: A sleep dose-response study. Journal of Sleep Research, 12, 1–12.
Chervoneva, I., & Vishnyakov, M. (2011). Constrained S-estimators for linear mixed effects models with covariance components. Statistics in Medicine, 30(14), 1735–1750.
Chervoneva, I., & Vishnyakov, M. (2014). Generalized S-estimators for linear mixed effects models. Statistica Sinica, 1257–1276.
Copt, S., & Victoria-Feser, M. (2006). High-breakdown inference for mixed linear models. Journal of the American Statistical Association, 101(473), 292–300.
Davies, O. L., & Goldsmith, P. L., (Eds.) (1972). Statistical methods in research and production (4th edn.) Hafner.
Fellner, W. (1986). Robust estimation of variance components. Technometrics, 28(1), 51–60.
Fernández, C., & Steel, M. F. (1998). On Bayesian modeling of fat tails and skewness. Journal of the American Statistical Association, 93(441), 359–371.
Geraci, M., & Bottai, M. (2014). Linear quantile mixed models. Statistics and Computing, 24(3), 461–479.
Heritier, S., Cantoni, E., Copt, S., & Victoria-Feser, M. (2009). Robust methods in biostatistics. John Wiley & Sons.
King, R., & Anderson, E. (2021). skewt: The skewed Student-t distribution. R package version 1.0.
Koller, M. (2013). Robust estimation of linear mixed models. Diss., ETH Zürich, Nr. 20997, 2013.
Koller, M. (2016). robustlmm: an R package for robust estimation of linear mixed-effects models. Journal of Statistical Software, 75, 1–24.
Koller, M. (2022). Replication code for simulation studies. https://CRAN.R-Project.org/package=robustlmm. Vignette included in R package robustlmm, version 3.0.
Koller, M., & Stahel, W. A. (2011). Sharpening Wald-type inference in robust regression for small samples. Computational Statistics & Data Analysis, 55(8), 2504–2515.
Maronna, R. A., Martin, R. D., Yohai, V. J., & Salibián-Barrera, M. (2019). Robust statistics: Theory and methods (with R). John Wiley & Sons.
Mason, F., Cantoni, E., & Ghisletta, P. (2021). Parametric and semi-parametric bootstrap-based confidence intervals for robust linear mixed models. Methodology, 17(4), 271–295.
Miller, J. (1977). Asymptotic properties of maximum likelihood estimates in the mixed model of the analysis of variance. The Annals of Statistics, 5(4), 746–762.
Papritz, A., Künsch, H. R., Schwierz, C., & Stahel, W. A. (2013). Robust geostatistical analysis of spatial data. In EGU General Assembly Conference Abstracts (p. 14145).
Pinheiro, J., Liu, C., & Wu, Y. (2001). Efficient algorithms for robust estimation in linear mixed-effects models using the multivariate t distribution. Journal of Computational and Graphical Statistics, 10(2), 249–276.
R Core Team (2014). R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing.
Richardson, A. (1997). Bounded influence estimation in the mixed linear model. Journal of the American Statistical Association, 92(437).
Searle, S. R., Casella, G., & McCulloch, C. E. (1992). Variance components. John Wiley & Sons.
Stahel, W. (1987). Estimation of a covariance matrix with location: Asymptotic formulas and optimal B-robust estimators. Journal of Multivariate Analysis, 22(2), 296–312.
Stahel, W., & Welsh, A. (1997). Approaches to robust estimation in the simplest variance components model. Journal of Statistical Planning and Inference, 57(2), 295–319.
Acknowledgements
The authors would like to thank Kali Tal for providing editorial help with an earlier version of the manuscript.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix
Appendix
1.1 Linear Approximation of Estimated Quantities
In this section, we develop linear approximations to the residuals and the estimated random effects . We use these linear approximations to compute the expected values in the estimating equations as well as the scaling factors τ used in the DAS approach.
Let , , \(\mathbf \psi _e^* = {} \mathbf \psi _e\kern -1pt\left ({\mathbf \varepsilon ^{*}}/{\sigma }\right )\kern +0.5pt/\lambda _e\), \(\mathbf \psi _b^* = \boldsymbol {\Lambda }_b^{-1}\mathbf \psi _b\kern -1pt\left (\mathbf b^{*}/\sigma \right )\kern +0.5pt\), \(\boldsymbol {D}_e =\mathbf {Diag}\kern -1pt\left (\mathbf \psi ^{\prime }_e\kern -1pt\left ({\mathbf \varepsilon ^{*}}/{\sigma }\right )\kern +0.5pt\right )\kern +0.5pt/\lambda _e\), with , , and .
We linearize around β and b∗, which will be the “true” β and B∗ later on,
Plugging these expressions into the estimating equations (7), divided by λe, and (11) and combining both equations into one yields
where
Using the formula for the inversion of a partitioned matrix, we have
where
or, equivalently,
Plugging this into (14), we get an approximation for the residuals and for the estimated random effects,
with
1.2 Covariance Matrices
The approximations (15) and (16) are used in the computation of covariance matrices. In simpler setups, covariance matrices are calculated on the basis of influence functions IF by integrating . IF is obtained, in the same way as for any M-estimator, from a linear approximation and results proportional to the ψ-function, the factor being the integral of its derivative, \(\lambda = {\mathbf {E}}_{0}\left [\psi ^{\prime }_e\right ]\). Even though we have no rigorous proof for a generalization to our case, we apply this idea here.
The expected values of De and Db are the identity matrices. When these expected values are used as approximations, the matrices M.. and A.. depend only on θ. The calculation of covariance matrices is then straightforward. They will contain the following expectations under the standard normal distribution:
where the dot (.) stands for e or b. The corresponding expressions for the block-diagonal case are and . These are diagonal matrices with entries \(\gamma _{b}^{(1)}\kern -1pt\left (k\kern -1pt\left (j\right )\kern +0.5pt,m_{k\kern -1pt\left (j\right )\kern +0.5pt}\right )\kern +0.5pt\) and \(\gamma _{b}^{(2)}\kern -1pt\left (k\kern -1pt\left (j\right )\kern +0.5pt,m_{k\kern -1pt\left (j\right )\kern +0.5pt}\right )\kern +0.5pt\), respectively, which depend on the dimensions of the blocks k. They are given by
For fully diagonal Vb, m = 1 and these formulas reduce to \(\gamma _{b}^{(1)}\) and \(\gamma _{b}^{(2)}\).
The covariance matrix of the estimated fixed effects is
For the derivation of the last equality, we have used the following two identities:
For the DAS standardization, we need the covariance matrix of the residuals \(\widehat {\varepsilon }^{*}_i\) and the \(\widehat {b}^{*}_j\),
1.3 Refined Design Adaptive Scale
We first write down the equation determining τe,i for the determination of \(\widehat \sigma \) through (9). The requirement that the ith term in the sum should be zero in expectation translates to the implicit equation
for τi, where \(\psi ^{(\sigma )}\kern -1pt\left (e\right )\kern +0.5pt=e^2w^{(\sigma )}\kern -1pt\left (e\right )\kern +0.5pt\), φ is the standard normal density, \(\sigma ^{(R)}_i\) is the standard deviation of Ri, and κσ is defined below (9). The modification for the variance components θℓ in the case of diagonal Vb is straightforward.
For random effects with block-diagonal covariance structure, we have
and Tk is determined by
where \(\psi ^{(\eta )}\kern -1pt\left (\mathbf b\right )\kern +0.5pt=\mathbf b\; w^{(\eta )}\kern -1pt\left (\|\mathbf b\|\right )\kern +0.5pt^{1/2}\) and \(\boldsymbol {V}^{(R)}_k\) is the covariance matrix of Rk. (Note that the normalizing constants of the densities cancel.) Integration thus extends over \(2m\kern -1pt\left (k\right )\kern +0.5pt\) dimensions. With this choice of Tk, each term in the sum (12) has approximate expectation zero. To see this, note that . Therefore, multiplying the last equation by from the right and forming the trace proves the result.
The last equation resembles the problem of estimating a robust covariance matrix and can be computed along the same lines.
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Koller, M., Stahel, W.A. (2023). Robust Estimation of General Linear Mixed Effects Models. In: Yi, M., Nordhausen, K. (eds) Robust and Multivariate Statistical Methods. Springer, Cham. https://doi.org/10.1007/978-3-031-22687-8_14
Download citation
DOI: https://doi.org/10.1007/978-3-031-22687-8_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-22686-1
Online ISBN: 978-3-031-22687-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)