Skip to main content

Advertisement

Log in

Efficient robust estimation for single-index mixed effects models with missing observations

  • Regular Article
  • Published:
Statistical Papers Aims and scope Submit manuscript

Abstract

In this paper, we study the efficient robust estimation and empirical likelihood for a single-index mixed effects model with a subset of covariates and response missing at random. Three efficient robust estimators and empirical likelihood ratios for index coefficients are constructed using weighted, imputed and weighted-imputed method, their asymptotic properties are proved. Our results show that the three estimators are asymptotically equivalent, and a weighted-imputed empirical log-likelihood ratio is asymptotically chi-squared. An important feature of our methods is their ability to handle missing response and/or partially missing covariates. Some simulation studies and a real data example indicate that our methods have fine performance in finite sample, and are available in practice.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

References

  • Arnold SF (1981) The theory of linear models and multivariate analysis. Wiley, New York

    Google Scholar 

  • Carroll RJ, Fan J, Gijbels I, Wand MP (1997) Generalized partially linear single-index models. J Am. Stat. Assoc. 92:477–489

    Article  MathSciNet  Google Scholar 

  • Chen XH, Hong H, Tarozzi A (2008) Semiparametric efficiency in GMM models with auxiliary data. Ann Stat 36:808–843

    Article  MathSciNet  Google Scholar 

  • Craven P, Wahba G (1979) Smoothing and noisy data with spline functions: estimating the correct degree of smoothing by the method of generalized cross-validation. Numer Math 31:377–403

    Article  MathSciNet  Google Scholar 

  • Fan JQ, Gijbels I (1996) Local polynomial modeling and its applications. Chapman and Hall, London

    Google Scholar 

  • Guo X, Niu CZ, Yang YP, Xu WL (2015) Empirical likelihood for single index model with missing covariates at random. Statistics 49:588–601

    Article  MathSciNet  Google Scholar 

  • Liang H, Wang SJ, Robins JM, Carroll RJ (2004) Estimation in partially linear models with missing covariates. J Am Stat Assoc 99:357–367

    Article  MathSciNet  Google Scholar 

  • Liang KY, Zeger SL (1986) Longitudinal data analysis using generalized linear model. Biometrika 73:13–22

    Article  MathSciNet  Google Scholar 

  • Lin XH, Carroll RJ (2001) Semiparametric regression for clustered data using generalized estimating equations. J Am Stat Assoc 96:1045–1056

    Article  MathSciNet  Google Scholar 

  • Masry E, Tjøstheim D (1995) Nonparametric estimation and identification of nonlinear ARCH time series: Strong convergence and asymptotic normality. Econometric Theory 11:258–289

    Article  MathSciNet  Google Scholar 

  • Owen AB (1990) Empirical likelihood ratio confidence regions. Ann Stat 18:90–120

    Article  MathSciNet  Google Scholar 

  • Pang Z, Xue LG (2012) Estimation for single-index models with random effects. Comput Stat Data Anal 56:1837–1853

    Article  MathSciNet  Google Scholar 

  • Qin J, Lawless J (1994) Empirical likelihood and general estimating equations (in likelihood and related topics). Ann Stat 22:300–325

    Article  Google Scholar 

  • Qin J, Zhang B, Leung DHY (2009) Empirical likelihood in missing data problems. J Am Stat Assoc 104:1492–1503

    Article  MathSciNet  Google Scholar 

  • Robins JM, Rotnitzky A, Zhao LP (1994) Estimation of regression-coefficients when some regressors are not always observed. J Am Stat Assoc 89:846–866

    Article  MathSciNet  Google Scholar 

  • Ruckstuhl AF, Welsh AH, Carroll RJ (2000) Nonparametric function estimation of the relationship between two repeatedly measured variables. Stat Sin 10:51–71

    MathSciNet  Google Scholar 

  • Stute W, Zhu LX (2005) Nonparametric checks for single-index models. Ann Stat 33:1048–1083

    Article  MathSciNet  Google Scholar 

  • Wang JL, Xue LG, Zhu LX, Chong YS (2010) Estimation for a partial-linear single-index model. Ann Stat 38:246–274

    Article  MathSciNet  Google Scholar 

  • Wei Y, Ma Y, Carroll R (2012) Multiple imputation in quantile regression. Biometrika 99:423–438

    Article  MathSciNet  PubMed  PubMed Central  Google Scholar 

  • Welsh AH (1989) On M-processes and M-estimation. Ann Stat. 17 337–361. [Correction(1990) 18 1500.]

  • Wu H, Wu L (2001) A multiple imputation method for missing covariates in nonlinear mixed-effect models, with application to HIV dynamics. Stat Med 20:1755–1769

    Article  CAS  PubMed  Google Scholar 

  • Wu L (2002) A joint model for nonlinear mixed-effects models with censoring and covariates measured with error, with application to AIDS studies. J Am Stat Assoc 97:955–964

    Article  Google Scholar 

  • Xue LG (2013) Estimation and empirical likelihood for single-index models with missing data in the covariates. Comput Stat Data Anal 60:82–97

    Article  MathSciNet  Google Scholar 

  • Xue LG, Lian H (2016) Empirical likelihood for single-index models with responses missing at random. Sci China Math 59:1187–1207

    Article  MathSciNet  Google Scholar 

  • Xue LG, Zhang JH (2020) Empirical likelihood for single-index models with missing observations. Comput Stat Data Anal 44, 106877

Download references

Acknowledgements

The authors are very grateful to the chief editor and the associate editor for their strong support and help! Thanks to the two anonymous reviewers for their careful review! Many useful opinions and suggestions have greatly improved the quality of this paper. This work was supported by the National Natural Science Foundation of China (11971001). The data set of real example is from an AIDS clinical trial group (ACTG) study.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Liugen Xue.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix. Proofs of Theorems

Appendix. Proofs of Theorems

To facilitate analysis, we introduce several lemmas, the proofs of which are given in the supplementary material.

Lemma 1

Suppose that conditions (C1)–(C8) hold. Then

$$\begin{aligned} \widehat{Q}_\textrm{L}(\beta ^{(r)}) = Q_\textrm{WI}(\beta _0^{(r)})- A(\beta ^{(r)}-\beta _0^{(r)}) + o_P(n^{-1/2}) \end{aligned}$$

when L = W, I and WI respectively, uniformly for \(\beta ^{(r)}\in \mathcal{B}_n^*\) with \(\mathcal{B}_n^*=\{\beta ^{(r)}|\,\Vert \beta ^{(r)}-\beta _0^{(r)}\Vert \le cn^{-1/2}\}\) for a constant \(c>0\), where \(\widehat{Q}_\textrm{W}(\beta ^{(r)})\), \(\widehat{Q}_\textrm{I}(\beta ^{(r)})\), \(\widehat{Q}_\textrm{WI}(\beta ^{(r)})\) and A are defined in (2.6) and (2.12) and (3.2) respectively,

$$\begin{aligned} Q_\textrm{WI}(\beta ^{(r)}) = \frac{1}{n}\sum _{i=1}^n\Bigg \{\frac{\delta _i}{\pi (V_i)}\xi _i(\beta ^{(r)}) + \left( 1-\frac{\delta _i}{\pi (V_i)}\right) m(V_i;\beta ^{(r)})\Bigg \}, \end{aligned}$$

\(\xi _i(\beta ^{(r)})\) is defined in (3.4) and \(m(V_i;\beta ^{(r)})=E\{\xi _i(\beta ^{(r)})|V_i\}\).

Lemma 2

Suppose that conditions (C1)–(C8) hold. Then

$$\begin{aligned} \sqrt{n}\widehat{Q}_\textrm{L}(\beta _0^{(r)}){\mathop {\longrightarrow }\limits ^{D}}N(0,B) \end{aligned}$$

when L = W, I and WI respectively, where \(\widehat{Q}_\textrm{W}(\beta _0^{(r)})\), \(\widehat{Q}_\textrm{I}(\beta _0^{(r)})\) and \(\widehat{Q}_\textrm{WI}(\beta _0^{(r)})\) are defined in (2.6) and (2.12) respectively, and B is defined in (3.3).

Lemma 3

Suppose that conditions (C1)–(C8) hold. Then

$$\begin{aligned}{} & {} \quad \mathrm{(a)}~~\frac{1}{n}\sum _{i=1}^n\hat{\eta }_{i,\mathrm L}(\beta _0^{(r)})\hat{\eta }_{i,\mathrm L}^T(\beta _0^{(r)}){\mathop {\longrightarrow }\limits ^{P}}C_\textrm{L}, \nonumber \\{} & {} \quad \mathrm{(b)}~~\frac{1}{n}\sum _{i=1}^n\hat{\eta }_{i,\mathrm WI}(\beta _0^{(r)})\hat{\eta }_{i,\mathrm WI}^T(\beta _0^{(r)}){\mathop {\longrightarrow }\limits ^{P}}B \end{aligned}$$

when L = W and I respectively, where B, \(C_\textrm{W}\) and \(C_\textrm{I}\) are defined in (3.3), (3.5) and (3.6) respectively, and \(\hat{\eta }_{i,\mathrm W}(\cdot )\), \(\hat{\eta }_{i,\mathrm I}(\cdot )\) and \(\hat{\eta }_{i,\mathrm WI}(\cdot )\) are defined in (2.9), (2.13) and (2.14) respectively.

Lemma 4

Suppose that conditions (C1)–(C8) hold. Then

$$\begin{aligned} \displaystyle \max _{1\le i\le n}|\hat{\eta }_{i,\mathrm L}(\beta _0^{(r)})| = o_P(n^{1/2}) \end{aligned}$$

when L = W, I and WI respectively.

Proof of Theorem 1

We consider the asymptotic normality of \(\hat{\beta }_\textrm{L}\) when L= W, I and WI respectively. The proof is divided into two steps: Step (I) provides the existence of the estimator \(\hat{\beta }_\textrm{L}^{(r)}\) of \(\beta _0^{(r)}\), and step (II) proves the asymptotic normality of \(\hat{\beta }_\textrm{L}^{(r)}\). \(\square \)

(I) Proof of existence. We prove the following fact: Under conditions (C1)–(C8) and with probability one there exists an estimator of \(\beta _0^{(r)}\) solving the estimating Eqs. (2.6) or (2.12) in \(\mathcal{B}_n^{**}\), where \(\mathcal{B}_n^{**}=\big \{\beta ^{(r)}|\, \Vert \beta ^{(r)}-\beta _0^{(r)}\Vert =Mn^{-1/2}\big \}\) for some constant M such that \(0<M<\infty \). In fact, by Lemma 1, we have

$$\begin{aligned} \widehat{Q}_\textrm{L}(\beta ^{(r)}) = Q_\textrm{WI}(\beta _0^{(r)}) - A(\beta ^{(r)}-\beta _0^{(r)}) + o_P(n^{-1/2}) \end{aligned}$$
(A.1)

when L = W, I and WI respectively, uniformly for \(\beta ^{(r)}\in \mathcal{B}_n^{**}\), where \(Q_\textrm{WI}(\beta ^{(r)})\) is defined in Lemma 1. Therefore, we have

$$\begin{aligned}&n(\beta ^{(r)}-\beta _0^{(r)})\widehat{Q}_\textrm{L}(\beta ^{(r)}) \nonumber \\&\quad = n(\beta ^{(r)}-\beta _0^{(r)})Q_\textrm{WI}(\beta _0^{(r)}) - n(\beta ^{(r)}-\beta _0^{(r)})A(\beta ^{(r)}-\beta _0^{(r)}) + o_P(1). \end{aligned}$$
(A.2)

We note that term (A.2) is dominated by the term \(\sim M^2\) because \(\sqrt{n}\Vert \beta ^{(r)}-\beta _0^{(r)}\Vert =M\), whereas \(|n(\beta ^{(r)}-\beta _0^{(r)})^TQ_\textrm{WI}(\beta _0^{(r)})|=MO_P(1)\), and \(n(\beta ^{(r)}-\beta _0^{(r)})A(\beta ^{(r)}-\beta _0^{(r)})\sim M^2\). So, for any given \(\eta >0\), if M is chosen large enough, then it will follows that \(n(\beta ^{(r)}-\beta _0^{(r)})\widehat{Q}_\textrm{L}(\beta ^{(r)})<0\) on an event with probability \(1-\eta \). From the arbitrariness of \(\eta \), we can prove the existence of the estimator of \(\beta _0^{(r)}\) in \(\mathcal{B}_n^{**}\) as in the proof of Theorem 5.1 of Welsh (1989). The details are omitted.

(II) Proof of asymptotic normality. From step (I) we find that \(\hat{\beta }_\textrm{L}^{(r)}\) is a solution in \(\mathcal{B}_n^{**}\) to the equation \(\widehat{Q}_\textrm{L}(\beta ^{(r)})=0\). That is, \(\widehat{Q}_\textrm{L}(\hat{\beta }_\textrm{L}^{(r)})=0\). From (A.1), we have

$$\begin{aligned} 0=Q_\textrm{WI}(\beta _0^{(r)}) - A(\hat{\beta }_\textrm{L}^{(r)} - \beta _0^{(r)}) + o_P(n^{-1/2}), \end{aligned}$$

and hence

$$\begin{aligned} \sqrt{n}(\hat{\beta }_\textrm{L}^{(r)} - \beta _0^{(r)}) = A^{-1}\sqrt{n}Q_\textrm{WI}(\beta _0^{(r)}) + o_P(1) \end{aligned}$$

when L = W, I and WI respectively. Theorem 2 follows from this, Lemma 2, Central Limit Theorem and Slutsky’s theorem. \(\square \)

Proof of Corollary 1

It is easy to prove

$$\begin{aligned} \hat{\beta }_\textrm{L} - \beta _0 = J_{\hat{\beta }_\textrm{L}^{(r)}}\big (\hat{\beta }_\textrm{L}^{(r)} - \beta _0^{(r)}\big ) \end{aligned}$$

when L = W, I and WI respectively. Therefore, Corollary 1 can be derived from Theorem 1. \(\square \)

Proof of Corollary 2

Note that

$$\begin{aligned} \hat{\beta }_\textrm{L}^T\beta _0 = \big (\hat{\beta }_\textrm{L} - \beta _0\big )^T\beta _0 + \beta _0^T\beta _0 \end{aligned}$$

when L = W, I and WI respectively. Therefore, from Theorem 1 we obtain

$$\begin{aligned} \hat{\beta }_\textrm{L}^T\beta _0 - 1 = O_P(n^{-1/2}) \end{aligned}$$

when L = W, I and WI respectively. This, together with the above formula, proves Corollary 2. \(\square \)

Proof of Theorem 2

By the Lagrange multiplier method, \(\mathcal{R}_\textrm{L}(\beta ^{(r)}_0)\) can be represented as

$$\begin{aligned} \mathcal{R}_\textrm{L}(\beta ^{(r)}_0)=2\sum _{i=1}^n\log (1+\lambda ^T\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)) \end{aligned}$$
(A.3)

when L = W, I and WI respectively, where \(\lambda =\lambda (\beta ^{(r)}_0)\) is a \((p-1)\times 1\) vector given as the solution to

$$\begin{aligned} \sum _{i=1}^n\frac{\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)}{1+\lambda ^T\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)}=0. \end{aligned}$$
(A.4)

By Lemmas 2 and 3, and using the same arguments as are used in the proof of (2.15) in Owen (1990), we can show that

$$\begin{aligned} \lambda =O_P(n^{-1/2}). \end{aligned}$$
(A.5)

Applying the Taylor expansion to (A.3), and invoking Lemmas 3–4 and (A.5), we get that

$$\begin{aligned} \mathcal{R}_\textrm{L}(\beta ^{(r)}_0) = 2 \sum _{i=1}^n[\lambda ^T\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0) - \{\lambda ^T\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)\}^2/2] +o_P(1) \end{aligned}$$
(A.6)

when L = W, I and WI respectively. By (A.4), it follows that

$$\begin{aligned} 0 = \sum _{i=1}^n\frac{\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)}{1+\lambda ^T\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)}= & {} \sum _{i=1}^n \hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0) - \sum _{i=1}^n\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)\hat{\eta }_{i,\mathrm L}^T(\beta ^{(r)}_0)\lambda \\{} & {} + \sum _{i=1}^n\frac{\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)\{\lambda ^T\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)\}^2}{1+\lambda ^T\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)}. \end{aligned}$$

This, together with Lemmas 3 and 4 as well as (A.5), proves that

$$\begin{aligned} \sum _{i=1}^n \{\lambda ^T\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)\}^2 = \sum _{i=1}^n\lambda ^T\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)+o_P(1) \end{aligned}$$

and

$$\begin{aligned} \lambda =\left\{ \sum _{i=1}^n\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)\hat{\eta }_{i,\mathrm L}^T(\beta ^{(r)}_0)\right\} ^{-1} \sum _{i=1}^n \hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0) + o_P(n^{-1/2}). \end{aligned}$$

Therefore, from (A.6) we have

$$\begin{aligned} \mathcal{R}_\textrm{L}(\beta ^{(r)}_0) =&\left\{ \frac{1}{\sqrt{n}}\sum _{i=1}^n\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)\right\} ^T \left\{ \frac{1}{n}\sum _{i=1}^n\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)\hat{\eta }_{i,\mathrm L}^T(\beta ^{(r)}_0) \right\} ^{-1}\nonumber \\&\times \left\{ \frac{1}{\sqrt{n}}\sum _{i=1}^n\hat{\eta }_{i,\mathrm L}(\beta ^{(r)}_0)\right\} +o_P(1) \end{aligned}$$
(A.7)

when L = W, I and WI respectively. This, together with Slutsky’s theorem and Lemmas 2 and 3, proves (c) of Theorem 2.

Now, we prove (a) of Theorem 2 only. Similarly, (b) of Theorem 2 can also be proved. From (A.7) and (a) of Lemma 3 we obtain

$$\begin{aligned} \mathcal{R}_\textrm{W}(\beta ^{(r)}_0) = \left\{ B^{-1/2}\frac{1}{\sqrt{n}}\sum _{i=1}^n\hat{\eta }_{i,\mathrm W}(\beta ^{(r)}_0)\right\} ^TD_1 \left\{ B^{-1/2}\frac{1}{\sqrt{n}}\sum _{i=1}^n\hat{\eta }_{i,\mathrm W}(\beta ^{(r)}_0)\right\} +o_P(1), \end{aligned}$$

where \(D_1=B^{1/2}C_\textrm{W}^{-1}B^{1/2}\). Let \(D_2=\text{ diag }(w_1^*,\ldots ,w_{p-1}^*)\), where \(w_i^*,\ldots ,w_{p-1}^*\) are the eigenvalues of \(C_\textrm{W}^{-1}B\). Note that \(D_1\) and \(C_\textrm{W}^{-1}B\) have the same eigenvalues. Then there exists an orthogonal matrix S such that \(S^TD_2S=D_1\). Hence

$$\begin{aligned} \mathcal{R}_\textrm{W}(\beta ^{(r)}_0)= & {} \left\{ SB^{-1/2}\frac{1}{\sqrt{n}}\sum _{i=1}^n\hat{\eta }_{i,\mathrm W}(\beta ^{(r)}_0)\right\} ^TD_2\\{} & {} \times \left\{ SB^{-1/2}\frac{1}{\sqrt{n}}\sum _{i=1}^n\hat{\eta }_{i,\mathrm W}(\beta ^{(r)}_0)\right\} +o_P(1). \end{aligned}$$

From Lemma 2 we obtain that

$$\begin{aligned} SB^{-1/2}\frac{1}{\sqrt{n}}\sum _{i=1}^n\hat{\eta }_{i,\mathrm W}(\beta ^{(r)}_0){\mathop {\longrightarrow }\limits ^{{D}}}N(\textbf{0},I_{p-1}). \end{aligned}$$

Above two results together proves (a) of Theorem 2. This completes the proof of Theorem 2. \(\square \)

Proof of Theorem 3

It is similar to the proof of Theorem 4 in Wang et al. (2010), and hence we omit its proof. \(\square \)

Proof of Theorem 4

Using the standard argument, we can obtain

$$\begin{aligned} n^{-1}S_{n,l}(u;\hat{\beta }_\textrm{L})=h_2^lp(u)\int \!\!t^lK(t)dt + O_P(h_2^{l+2}),~~l=0,1,2. \end{aligned}$$

Therefore, from (2.4) it follows

$$\begin{aligned} \hat{g}(u;\hat{\beta }_\textrm{L})-g(u)=\frac{1}{2}\mu _2h_2^2g''(u) + \frac{1}{n}\sum _{i=1}^n\frac{\delta _i\varepsilon _i}{\pi (V_i)p(u)}K_{h_2}(\beta ^TX_i-u) + o_P(c_n) \end{aligned}$$

when L = W, I and WI respectively, where \(\mu _2=\displaystyle \int \!\!t^2K(t)dt\), \(c_n=(nh_2)^{-1/2}+h_2^2\). By Theorem 4.4 of Masry and Tjøstheim (1995) as well as Slutsky’s theorem, Theorem 4 is proved. \(\square \)

Proof of Theorem 5

From (A.7). we can obtain

$$\begin{aligned} \tilde{R}_\textrm{L}(\beta ^{(r)}_0) = \{\sqrt{n}\widehat{Q}_\textrm{L}^T(\beta ^{(r)}_0)\}\hat{B}^{-1}(\beta ^{(r)}_0)\{\sqrt{n}\widehat{Q}_\textrm{L}(\beta ^{(r)}_0)\} +o_P(1) \end{aligned}$$

when L = W and I respectively. This, together with Slutsky’s theorem and Lemmas 2 and 3, proves Theorem 5. \(\square \)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xue, L., Xie, J. Efficient robust estimation for single-index mixed effects models with missing observations. Stat Papers 65, 827–864 (2024). https://doi.org/10.1007/s00362-023-01407-2

Download citation

  • Received:

  • Revised:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00362-023-01407-2

Keywords

Mathematics Subject Classification

Navigation