Robust estimation in joint mean–covariance regression model for longitudinal data

Zheng, Xueying; Fung, Wing Kam; Zhu, Zhongyi

doi:10.1007/s10463-012-0383-8

Robust estimation in joint mean–covariance regression model for longitudinal data

Published: 27 November 2012

Volume 65, pages 617–638, (2013)
Cite this article

Annals of the Institute of Statistical Mathematics Aims and scope Submit manuscript

Xueying Zheng¹,
Wing Kam Fung¹ &
Zhongyi Zhu²

678 Accesses
17 Citations
Explore all metrics

Abstract

In this paper, we develop robust estimation for the mean and covariance jointly for the regression model of longitudinal data within the framework of generalized estimating equations (GEE). The proposed approach integrates the robust method and joint mean–covariance regression modeling. Robust generalized estimating equations using bounded scores and leverage-based weights are employed for the mean and covariance to achieve robustness against outliers. The resulting estimators are shown to be consistent and asymptotically normally distributed. Simulation studies are conducted to investigate the effectiveness of the proposed method. As expected, the robust method outperforms its non-robust version under contaminations. Finally, we illustrate by analyzing a hormone data set. By downweighing the potential outliers, the proposed method not only shifts the estimation in the mean model, but also shrinks the range of the innovation variance, leading to a more reliable estimation in the covariance matrix.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Efficient parameter estimation via modified Cholesky decomposition for quantile regression with longitudinal data

Article 27 February 2017

Jing Lv & Chaohui Guo

Robust semiparametric modeling of mean and covariance in longitudinal data

Article 02 June 2023

Mengfei Ran, Yihe Yang & Yutaka Kano

Robust and efficient estimator for simultaneous model structure identification and variable selection in generalized partial linear varying coefficient models with longitudinal data

Article 25 March 2017

Kangning Wang & Lu Lin

References

Cantoni, E. (2004). A robust approach to longitudinal data analysis. The Canadian Journal of Statistics, 32, 169–180.
Google Scholar
Croux, C., Gijbels, I., Prosdocomo, I. (2012). Robust estimation of mean and dispersion functions in extended generalized additive models. Biometrics, 68, 31–44.
Google Scholar
Daniels, M., Zhao, Y. (2003). Modelling the random effects covariance matrix on longitudinal data. Statistics in Medicine, 22, 1631–1647.
Google Scholar
Fan, J., Huang, T., Li, R. (2007). Analysis of longitudinal data with semiparametric estimation of covariance function. Journal of the American Statistical Association, 35, 632–641.
Google Scholar
Fan, J., Wu, Y. (2008). Semiparametric estimation of covariance matrices for longitudinal data. Journal of the American Statistical Association, 103, 1520–1533.
Google Scholar
Fan, J., Zhang, J. T. (2000). Two-step estimation of functional linear models with application to longitudinal data. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 62, 303–322.
Google Scholar
Fung, W. K., Zhu, Z. Y., He, X. (2002). Influence diagnostics and outlier tests for semiparametric mixed models. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 64, 565–579.
Google Scholar
He, X., Fung, W. K., Zhu, Z. Y. (2005). Robust estimation in generalized partial linear models for clustered data. Journal of the American Statistical Association, 472, 1176–1184.
Google Scholar
Leng, C., Zhang, W., Pan, J. (2010). Semiparametric mean-covariance regression analysis for longitudinal data. Journal of the American Statistical Association, 105, 181–193.
Google Scholar
Levina, E., Rothman, A. J., Zhu, J. (2008). Sparse estimation of large covariance matrices via a nested Lasso penalty. Annals of Applied Statistics, 2, 245–263.
Google Scholar
Liang, K. Y., Zeger, S. L. (1986). Longitudinal data analysis using generalized linear models. Biometrika, 73, 13–22.
Google Scholar
Mao, J., Zhu, Z. Y., Fung, W. K. (2011). Joint estimation of mean-covariance model for longitudinal data with basis function approximations. Computational Statistics and Data Analysis, 55, 983–992.
Google Scholar
McCullagh, P. (1983). Quasi-likelihood functions. Annals of Statistics, 11, 59–67.
Pan, J., Mackenzie, G. (2003). On modelling mean-covariance structures in longitudinal studies. Biometrika, 90, 239–244.
Google Scholar
Pourahmadi, M. (1999). Joint mean-covariance models with applications to longitudinal data: unconstrained parameterisation. Biometrika, 86, 677–690.
Google Scholar
Pourahmadi, M. (2000). Maximum likelihood estimation of generalised linear models for multivariate normal covariance matrix. Biometrika, 87, 425–435.
Google Scholar
Qin, G. Y., Zhu, Z. Y. (2007). Robust estimation in generalized semiparametric mixed models for longitudinal data. Journal of Multivariate Analysis, 98, 1658–1683.
Google Scholar
Qin, G. Y., Zhu, Z. Y., Fung, W. K. (2009). Robust estimation of covariance parameters in partial linear model for longitudinal data. Journal of Statistical Planning and Inference, 139, 558–570.
Google Scholar
Qu, A., Lindsay, B., Li, B. (2000). Improving generalised estimating equations using quadratic inference functions. Biometrica, 87, 823–836.
Google Scholar
Sinha, S. (2004). Robust analysis of generalized linear mixed models. Journal of the American Statistical Association, 99, 451–460.
Google Scholar
Wang, N. (2003). Marginal nonparametric kernel regression accounting within-subject correlation. Biometrika, 90, 29–42.
Google Scholar
Wang, N., Carroll, R. J., Lin, X. (2005a). Efficient semiparametric marginal estimation for longitudinal/clustered data. Journal of the American Statistical Association, 100, 147–157.
Google Scholar
Wang, Y. G., Lin, X., Zhu, M. (2005b). Robust estimation functions and bias correction for longitudinal data analysis. Biometrics, 61, 684–691.
Google Scholar
Wu, W., Pourahmadi, M. (2003). Nonparametric estimation of large covariance matrices of longitudinal data. Biometrika, 90, 831–844.
Google Scholar
Ye, H., Pan, J. (2006). Modelling covariance structures in generalized estimating equations for longitudinal data. Biometrika, 93, 927–941.
Google Scholar
Zhang, D. W., Lin, X. H., Raz, J., Sowers, M. F. (1998). Semiparametric stochastic mixed models for longitudinal data. Journal of the American Statistical Association, 93, 710–719.
Google Scholar

Download references

Acknowledgments

The authors are grateful to the reviewers, the Associate Editor, and the Co-Editor for their insightful comments and suggestions which have improved the manuscript significantly.

Author information

Authors and Affiliations

Department of Statistics and Actuarial Science, The University of Hong Kong, Hong Kong, China
Xueying Zheng & Wing Kam Fung
Department of Statistics, School of Management, Fudan University, Shanghai, 200433, China
Zhongyi Zhu

Authors

Xueying Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Wing Kam Fung
View author publications
You can also search for this author in PubMed Google Scholar
Zhongyi Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Wing Kam Fung.

Appendix

1.1 Proofs

Regularity conditions:

A1.
We assume that the dimensions $p,\,q$ and $d$ of the covariates $x_{ij}$ $z_{ij}$ and $z_{ijk}$ are fixed and that $\{n_i\}$ is a bounded sequence of positive integers. The first four moments of $y_{ij}$ exist.
A2.
The parameter space of $(\beta ^{\prime },\gamma ^{\prime },\lambda ^{\prime })^{\prime },\,\varTheta $, is a compact subset of $R^{p+q+d}$, and the true parameter value $(\beta ^{\prime }_{0},\gamma ^{\prime }_{0},\lambda ^{\prime }_{0})^{\prime }$ is in the interior of the parameter space $\varTheta $.
A3.
The covariates $z_{ijk}$ and $z_{ij}$, the matrices $W_{i}^{-1}$ are all bounded, meaning that all the elements of the vectors are bounded. The function $\dot{g}^{-1}(\cdot )$ has bounded second derivatives.

Proof of Theorem 1

For illustration we only give the proof that $\hat{\beta }_m\rightarrow \beta _0$ almost surely. The proofs for $\hat{\gamma }_ m$ and $\hat{\lambda }_m$ are similar. According to McCullagh (1983), we have

$$\begin{aligned} \hat{\beta }_m-\beta _0&= \bigg \{ \frac{1}{m}\sum ^m_{i=1}X_i^{\prime }\varDelta _i (V^{\beta }_i)^{-1}\varGamma _i^{\beta }\varDelta ^{\prime }_iX_i\bigg \}^{-1} _{\beta =\beta _0}\\&\times \bigg \{\sum ^m_{i=1}X_i^{\prime }\varDelta _i(V^{\beta }_i) ^{-1}h^{\beta }_i(\mu _i(\beta ))\bigg \}_{\beta =\beta _0}+o_p(m^{-1/2}). \end{aligned}$$

On the other hand, the expectation and variance matrix of $U_{1i}=X_i^{\prime }\varDelta _i(V^{\beta }_i)^{-1}h^{\beta }_i(\mu _i(\beta ))$ at $\beta =\beta _0$ are given by $E_0(U_{1i})=0$ and

$$\begin{aligned} \text{ var}_0(U_{1i})=\bigg \{X_i^{\prime }\varDelta _i(V^{\beta }_i)^{-1}\varGamma _ i^{\beta }\varDelta ^{\prime }_iX_i\bigg \}_{\beta _0}=(G_i^0X_i^{\prime }X_i)^{\prime } (V^{\beta }_i)^{-1}\varGamma ^{\beta }_i(G^{0}_iX_i^{\prime }X_i), \end{aligned}$$

where $G_i^{0}=\text{ diag}\{\dot{g}^{-1}(x_{i1}^{\prime }\beta _0),\dots , \dot{g}^{-1}(x^{\prime }_{in_i}\beta _0)\}$ is an $n_i\times n_i$ diagonal matrix.

Since $V_i^{\beta }=A_i^{-1/2}\varSigma _i$ and $\varSigma _i^{-1}=\varPhi ^{\prime }_iD^{-1}_i\varPhi _i$, the variance can be further written as $\text{ var}_0(U_{1i})=(G^0_iX_i^{\prime }X_i)^{\prime }\varPhi _i(D_i^{-1}A_i^{-1/2})\varPhi ^{\prime } _i\varGamma ^{\beta }_i(G^0_iX_i^{\prime }X_i)$. Condition A3 above implies that there exists a constant $\kappa _0$ such that $\text{ var}_0(U_{1i})\le \kappa _01_{p\times p}$ for all $i$ and all $\theta \in \varTheta $, where $1_{p\times p}$ is the $p\times p$ matrix with all elements being 1’s, meaning that all elements of $\text{ var}_0(U_{1i})$ are bounded by $\kappa _0$. Thus $\sum ^{\infty }_{i=1}\text{ var}_0(U_{1i})/i^2<\infty $. By Kolmogorov’s strong law of large numbers we know that

$$\begin{aligned} \bigg \{\frac{1}{m}\sum ^m_{i=1}X_i^{\prime }\varDelta _i(V^{\beta }_i)^{-1}h^ {\beta }_i(\mu _i(\beta ))\bigg \}_{\beta =\beta _0}\rightarrow 0 \end{aligned}$$

almost surely as $m\rightarrow \infty $. In the same manner it can be shown that

$$\begin{aligned} \bigg \{\frac{1}{m}\sum ^m_{i=1}X_i^{\prime }\varDelta _i(V^{\beta }_i)^{-1} \varGamma _i^{\beta }\varDelta ^{\prime }_iX_i\bigg \}_{\beta =\beta _0} \end{aligned}$$

is a bounded matrix. This leads to $\hat{\beta }_m-\beta _0\rightarrow 0 $ almost surely as $m\rightarrow \infty $. The proof is complete. $\square $

Proof of Theorem 2

First we give some notations. Define

$$\begin{aligned}&H_m=\sum ^m_{i=1} X_i^{\prime } \varDelta _i(V^\beta _i)^{-1}\varGamma _i^\beta \varDelta _iX_i, \nonumber \\&B_m=\sum ^m_{i=1} T_i^{\prime }(V_i^\gamma )^{-1}\varGamma ^\gamma _i T_i, \nonumber \\&C_m=\sum ^m_{i=1} Z_i^{\prime } D_i (V_i^\lambda )^{-1}\varGamma _i^\lambda D_iZ_i. \nonumber \\&\tilde{U}_1(\beta )=\sum ^m_{i=1}X_i^{\prime } \varDelta _{0i} (V_{0i}^\beta )^{-1} h^\beta _{0i}(\mu _{0i}(\beta )),\end{aligned}$$

(11)

$$\begin{aligned}&\tilde{U}_2(\gamma )=\sum ^m_{i=1} T_i^{\prime }(V^\gamma _{0i})^{-1} h^\gamma _{0i}(\hat{r}_{0i}(\gamma )),\end{aligned}$$

(12)

$$\begin{aligned}&\tilde{U}_3(\lambda )=\sum ^m_{i=1} Z_i^{\prime } D_{0i} (V^\lambda _{0i})^{-1} h^\lambda _{0i} (\sigma ^2_{0i}(\lambda )). \end{aligned}$$

(13)

$$\begin{aligned}&\xi =H_m^{1/2}(\beta -\beta _0),\ \hat{\xi }=\xi (\hat{\beta }_m)=H_m^{1/2}(\hat{\beta }_m -\beta _0),\ \tilde{\xi }=H_m^{1/2}\tilde{U}_1;\\&\eta =B_m^{1/2}(\gamma -\gamma _0),\ \hat{\eta }=\eta (\hat{\gamma }_m)=B_m^{1/2}(\hat{\gamma }_m-\gamma _0), \ \tilde{\eta }=B_m^{1/2}\tilde{U}_2;\\&\zeta =C_m^{1/2}(\lambda -\lambda _0),\ \hat{\zeta }=\zeta (\hat{\lambda }_m)=C_m^{1/2}(\hat{\lambda }_m-\lambda _0),\ \tilde{\zeta }=C_m^{1/2}\tilde{U}_3. \end{aligned}$$

Next we prove the following Lemma:

Lemma

Under condition (A1)–(A3),

$$\begin{aligned}&||\hat{\xi }-\tilde{\xi }||=o_p(1),\end{aligned}$$

(14)

$$\begin{aligned}&||\hat{\eta }-\tilde{\eta }||=o_p(1),\end{aligned}$$

(15)

$$\begin{aligned}&||\hat{\zeta }-\tilde{\zeta }||=o_p(1). \end{aligned}$$

(16)

Define

$$\begin{aligned}&\Psi (\xi )=H_m^{1/2}U_1(\beta )=H_m^{1/2}U_1(\xi ),\end{aligned}$$

(17)

$$\begin{aligned}&\varPhi (\xi )=H_m^{1/2}\tilde{U}_1(\xi )-\xi . \end{aligned}$$

(18)

By condition $($A1$)$–$($A3$)$, $\Psi (\xi )$ and $U_1$ give the same root for $\xi $. The solution of $\varPhi $ is $\tilde{\xi }$. Following the proof of Theorem $1$ in He et al. (2005), we immediately obtain that

$$\begin{aligned} \text{ sup}_{||\xi ||\le L}||\Psi (\xi )-\varPhi (\xi )||=o_p (1), \ ||\xi ||=O_p (1), \end{aligned}$$

where $L$ is a sufficiently large number. By Brouwer’s fixed-point theorem, (11) is verified. We can prove (12) and (13) similarly. $\square $

By Lemma, we only need to show the asymptotic normality of $(\tilde{\xi }^{\prime },\ \tilde{\eta }^{\prime },\ \tilde{\zeta }^{\prime })^{\prime }/\sqrt{m}$. This is equivalent to the asymptotic normality of $(\tilde{U}_{1}^{\prime },\ \tilde{U}_{2}^{\prime },\ \tilde{U}_{3}^{\prime })/\sqrt{m}$. Note that Conditions (A1)–(A3) imply that

$$\begin{aligned} \text{ E}_{0}[\varsigma ^{\prime }\{X_{i}^{\prime }\varDelta _{0i}(V_{i}^{\beta })^{-1}h _{i}^{\beta }\}+\omega ^{\prime }\{T_{i}^{\prime }(V_{i}^{\gamma })^{-1}h_{i}^{\gamma }\} +\phi ^{\prime }\{Z_{i}^{\prime }D_{0i}(V_{i}^{\rho })^{-1}h_{i}^{\lambda } \} ]^{3}<\kappa , \end{aligned}$$

for any $\varsigma \in R^{p+K},\ \omega \in R^{q}\, \text{ and}\, \phi \in R^{d+K^{\prime }}$, where $\kappa $ is a constant independent of $i$.

Furthermore, we have

$$\begin{aligned}&\frac{1}{m}\sum _{i=1}^{m}V[\varsigma ^{\prime }\{X_{i}^{\prime }\varDelta _{0i}(V_{i} ^{\theta })^{-1}h_{i}^{\theta }\}+\omega ^{\prime }\{T_{i}^{\prime }(V_{i}^{\gamma }) ^{-1}h_{i}^{\gamma }\}+\phi ^{\prime }\{Z_{i}^{\prime }D_{0i}(V_{i}^{\rho })^{-1}h_{i}^{\rho } \} ]\\&\qquad =(\varsigma ^{\prime },\omega ^{\prime },\phi ^{\prime })\frac{1}{m}V_m(\varsigma ^{\prime },\omega ^{\prime },\phi ^{\prime })^{\prime } \rightarrow (\varsigma ^{\prime },\omega ^{\prime },\phi ^{\prime })^{\prime }V(\varsigma ^{\prime },\omega ^{\prime },\phi ^{\prime })^{\prime }>0. \end{aligned}$$

Therefore, the asymptotic normality of $(\tilde{U}_{1}^{\prime },\ \tilde{U}_{2}^{\prime },\ \tilde{U}_{3}^{\prime })/\sqrt{m}$ is easily proved by multivariate Liapounov central limit theorem. Therefore,

$$\begin{aligned} \sqrt{m}\left( \begin{array}{c} \hat{\beta }_m-\beta _0 \\ \hat{\gamma }_m-\gamma _0 \\ \hat{\lambda }_m-\lambda _0 \\ \end{array} \right)=\left( \begin{array}{c@{\quad }c@{\quad }c} (H_m/m)^{-1}&0&0 \\ 0&(B_m/m)^{-1}&0 \\ 0&0&(C_m/m)^{-1} \\ \end{array} \right) \left( \begin{array}{c} \tilde{U}_1/\sqrt{m} \\ \tilde{U}_2/\sqrt{m} \\ \tilde{U}_3/\sqrt{m} \\ \end{array} \right)\qquad \quad \end{aligned}$$

(19)

$$\begin{aligned} \rightarrow N \left\{ 0,\ \left( \begin{array}{c@{\quad }c@{\quad }c} v^{11}&0&0 \\ 0&v^{22}&0 \\ 0&0&v^{33} \\ \end{array} \right)^{-1}\left( \begin{array}{c@{\quad }c@{\quad }c} v^{11}&v^{12}&v^{13} \\ v^{21}&v^{22}&v^{23} \\ v^{31}&v^{32}&v^{33} \\ \end{array} \right)\left( \begin{array}{c@{\quad }c@{\quad }c} v^{11}&0&0 \\ 0&v^{22}&0 \\ 0&0&v^{33} \\ \end{array} \right)^{-1} \right\} \qquad \quad \end{aligned}$$

(20)

The proof of Theorem 2 is completed. $\square $

About this article

Cite this article

Zheng, X., Fung, W.K. & Zhu, Z. Robust estimation in joint mean–covariance regression model for longitudinal data. Ann Inst Stat Math 65, 617–638 (2013). https://doi.org/10.1007/s10463-012-0383-8

Download citation

Received: 14 December 2011
Revised: 17 July 2012
Published: 27 November 2012
Issue Date: August 2013
DOI: https://doi.org/10.1007/s10463-012-0383-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Robust estimation in joint mean–covariance regression model for longitudinal data

Abstract

Access this article

Similar content being viewed by others

Efficient parameter estimation via modified Cholesky decomposition for quantile regression with longitudinal data

Robust semiparametric modeling of mean and covariance in longitudinal data

Robust and efficient estimator for simultaneous model structure identification and variable selection in generalized partial linear varying coefficient models with longitudinal data

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

1.1 Proofs

Proof of Theorem 1

Proof of Theorem 2

Lemma

About this article

Cite this article

Keywords

Navigation

Robust estimation in joint mean–covariance regression model for longitudinal data

Abstract

Access this article

Similar content being viewed by others

Efficient parameter estimation via modified Cholesky decomposition for quantile regression with longitudinal data

Robust semiparametric modeling of mean and covariance in longitudinal data

Robust and efficient estimator for simultaneous model structure identification and variable selection in generalized partial linear varying coefficient models with longitudinal data

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendix

Appendix

1.1 Proofs

Proof of Theorem 1

Proof of Theorem 2

Lemma

About this article

Cite this article

Share this article

Keywords

Search

Navigation