Abstract
In this paper, we study the efficient robust estimation and empirical likelihood for a single-index mixed effects model with a subset of covariates and response missing at random. Three efficient robust estimators and empirical likelihood ratios for index coefficients are constructed using weighted, imputed and weighted-imputed method, their asymptotic properties are proved. Our results show that the three estimators are asymptotically equivalent, and a weighted-imputed empirical log-likelihood ratio is asymptotically chi-squared. An important feature of our methods is their ability to handle missing response and/or partially missing covariates. Some simulation studies and a real data example indicate that our methods have fine performance in finite sample, and are available in practice.
Similar content being viewed by others
References
Arnold SF (1981) The theory of linear models and multivariate analysis. Wiley, New York
Carroll RJ, Fan J, Gijbels I, Wand MP (1997) Generalized partially linear single-index models. J Am. Stat. Assoc. 92:477–489
Chen XH, Hong H, Tarozzi A (2008) Semiparametric efficiency in GMM models with auxiliary data. Ann Stat 36:808–843
Craven P, Wahba G (1979) Smoothing and noisy data with spline functions: estimating the correct degree of smoothing by the method of generalized cross-validation. Numer Math 31:377–403
Fan JQ, Gijbels I (1996) Local polynomial modeling and its applications. Chapman and Hall, London
Guo X, Niu CZ, Yang YP, Xu WL (2015) Empirical likelihood for single index model with missing covariates at random. Statistics 49:588–601
Liang H, Wang SJ, Robins JM, Carroll RJ (2004) Estimation in partially linear models with missing covariates. J Am Stat Assoc 99:357–367
Liang KY, Zeger SL (1986) Longitudinal data analysis using generalized linear model. Biometrika 73:13–22
Lin XH, Carroll RJ (2001) Semiparametric regression for clustered data using generalized estimating equations. J Am Stat Assoc 96:1045–1056
Masry E, Tjøstheim D (1995) Nonparametric estimation and identification of nonlinear ARCH time series: Strong convergence and asymptotic normality. Econometric Theory 11:258–289
Owen AB (1990) Empirical likelihood ratio confidence regions. Ann Stat 18:90–120
Pang Z, Xue LG (2012) Estimation for single-index models with random effects. Comput Stat Data Anal 56:1837–1853
Qin J, Lawless J (1994) Empirical likelihood and general estimating equations (in likelihood and related topics). Ann Stat 22:300–325
Qin J, Zhang B, Leung DHY (2009) Empirical likelihood in missing data problems. J Am Stat Assoc 104:1492–1503
Robins JM, Rotnitzky A, Zhao LP (1994) Estimation of regression-coefficients when some regressors are not always observed. J Am Stat Assoc 89:846–866
Ruckstuhl AF, Welsh AH, Carroll RJ (2000) Nonparametric function estimation of the relationship between two repeatedly measured variables. Stat Sin 10:51–71
Stute W, Zhu LX (2005) Nonparametric checks for single-index models. Ann Stat 33:1048–1083
Wang JL, Xue LG, Zhu LX, Chong YS (2010) Estimation for a partial-linear single-index model. Ann Stat 38:246–274
Wei Y, Ma Y, Carroll R (2012) Multiple imputation in quantile regression. Biometrika 99:423–438
Welsh AH (1989) On M-processes and M-estimation. Ann Stat. 17 337–361. [Correction(1990) 18 1500.]
Wu H, Wu L (2001) A multiple imputation method for missing covariates in nonlinear mixed-effect models, with application to HIV dynamics. Stat Med 20:1755–1769
Wu L (2002) A joint model for nonlinear mixed-effects models with censoring and covariates measured with error, with application to AIDS studies. J Am Stat Assoc 97:955–964
Xue LG (2013) Estimation and empirical likelihood for single-index models with missing data in the covariates. Comput Stat Data Anal 60:82–97
Xue LG, Lian H (2016) Empirical likelihood for single-index models with responses missing at random. Sci China Math 59:1187–1207
Xue LG, Zhang JH (2020) Empirical likelihood for single-index models with missing observations. Comput Stat Data Anal 44, 106877
Acknowledgements
The authors are very grateful to the chief editor and the associate editor for their strong support and help! Thanks to the two anonymous reviewers for their careful review! Many useful opinions and suggestions have greatly improved the quality of this paper. This work was supported by the National Natural Science Foundation of China (11971001). The data set of real example is from an AIDS clinical trial group (ACTG) study.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix. Proofs of Theorems
Appendix. Proofs of Theorems
To facilitate analysis, we introduce several lemmas, the proofs of which are given in the supplementary material.
Lemma 1
Suppose that conditions (C1)–(C8) hold. Then
when L = W, I and WI respectively, uniformly for \(\beta ^{(r)}\in \mathcal{B}_n^*\) with \(\mathcal{B}_n^*=\{\beta ^{(r)}|\,\Vert \beta ^{(r)}-\beta _0^{(r)}\Vert \le cn^{-1/2}\}\) for a constant \(c>0\), where \(\widehat{Q}_\textrm{W}(\beta ^{(r)})\), \(\widehat{Q}_\textrm{I}(\beta ^{(r)})\), \(\widehat{Q}_\textrm{WI}(\beta ^{(r)})\) and A are defined in (2.6) and (2.12) and (3.2) respectively,
\(\xi _i(\beta ^{(r)})\) is defined in (3.4) and \(m(V_i;\beta ^{(r)})=E\{\xi _i(\beta ^{(r)})|V_i\}\).
Lemma 2
Suppose that conditions (C1)–(C8) hold. Then
when L = W, I and WI respectively, where \(\widehat{Q}_\textrm{W}(\beta _0^{(r)})\), \(\widehat{Q}_\textrm{I}(\beta _0^{(r)})\) and \(\widehat{Q}_\textrm{WI}(\beta _0^{(r)})\) are defined in (2.6) and (2.12) respectively, and B is defined in (3.3).
Lemma 3
Suppose that conditions (C1)–(C8) hold. Then
when L = W and I respectively, where B, \(C_\textrm{W}\) and \(C_\textrm{I}\) are defined in (3.3), (3.5) and (3.6) respectively, and \(\hat{\eta }_{i,\mathrm W}(\cdot )\), \(\hat{\eta }_{i,\mathrm I}(\cdot )\) and \(\hat{\eta }_{i,\mathrm WI}(\cdot )\) are defined in (2.9), (2.13) and (2.14) respectively.
Lemma 4
Suppose that conditions (C1)–(C8) hold. Then
when L = W, I and WI respectively.
Proof of Theorem 1
We consider the asymptotic normality of \(\hat{\beta }_\textrm{L}\) when L= W, I and WI respectively. The proof is divided into two steps: Step (I) provides the existence of the estimator \(\hat{\beta }_\textrm{L}^{(r)}\) of \(\beta _0^{(r)}\), and step (II) proves the asymptotic normality of \(\hat{\beta }_\textrm{L}^{(r)}\). \(\square \)
(I) Proof of existence. We prove the following fact: Under conditions (C1)–(C8) and with probability one there exists an estimator of \(\beta _0^{(r)}\) solving the estimating Eqs. (2.6) or (2.12) in \(\mathcal{B}_n^{**}\), where \(\mathcal{B}_n^{**}=\big \{\beta ^{(r)}|\, \Vert \beta ^{(r)}-\beta _0^{(r)}\Vert =Mn^{-1/2}\big \}\) for some constant M such that \(0<M<\infty \). In fact, by Lemma 1, we have
when L = W, I and WI respectively, uniformly for \(\beta ^{(r)}\in \mathcal{B}_n^{**}\), where \(Q_\textrm{WI}(\beta ^{(r)})\) is defined in Lemma 1. Therefore, we have
We note that term (A.2) is dominated by the term \(\sim M^2\) because \(\sqrt{n}\Vert \beta ^{(r)}-\beta _0^{(r)}\Vert =M\), whereas \(|n(\beta ^{(r)}-\beta _0^{(r)})^TQ_\textrm{WI}(\beta _0^{(r)})|=MO_P(1)\), and \(n(\beta ^{(r)}-\beta _0^{(r)})A(\beta ^{(r)}-\beta _0^{(r)})\sim M^2\). So, for any given \(\eta >0\), if M is chosen large enough, then it will follows that \(n(\beta ^{(r)}-\beta _0^{(r)})\widehat{Q}_\textrm{L}(\beta ^{(r)})<0\) on an event with probability \(1-\eta \). From the arbitrariness of \(\eta \), we can prove the existence of the estimator of \(\beta _0^{(r)}\) in \(\mathcal{B}_n^{**}\) as in the proof of Theorem 5.1 of Welsh (1989). The details are omitted.
(II) Proof of asymptotic normality. From step (I) we find that \(\hat{\beta }_\textrm{L}^{(r)}\) is a solution in \(\mathcal{B}_n^{**}\) to the equation \(\widehat{Q}_\textrm{L}(\beta ^{(r)})=0\). That is, \(\widehat{Q}_\textrm{L}(\hat{\beta }_\textrm{L}^{(r)})=0\). From (A.1), we have
and hence
when L = W, I and WI respectively. Theorem 2 follows from this, Lemma 2, Central Limit Theorem and Slutsky’s theorem. \(\square \)
Proof of Corollary 1
It is easy to prove
when L = W, I and WI respectively. Therefore, Corollary 1 can be derived from Theorem 1. \(\square \)
Proof of Corollary 2
Note that
when L = W, I and WI respectively. Therefore, from Theorem 1 we obtain
when L = W, I and WI respectively. This, together with the above formula, proves Corollary 2. \(\square \)
Proof of Theorem 2
By the Lagrange multiplier method, \(\mathcal{R}_\textrm{L}(\beta ^{(r)}_0)\) can be represented as
when L = W, I and WI respectively, where \(\lambda =\lambda (\beta ^{(r)}_0)\) is a \((p-1)\times 1\) vector given as the solution to
By Lemmas 2 and 3, and using the same arguments as are used in the proof of (2.15) in Owen (1990), we can show that
Applying the Taylor expansion to (A.3), and invoking Lemmas 3–4 and (A.5), we get that
when L = W, I and WI respectively. By (A.4), it follows that
This, together with Lemmas 3 and 4 as well as (A.5), proves that
and
Therefore, from (A.6) we have
when L = W, I and WI respectively. This, together with Slutsky’s theorem and Lemmas 2 and 3, proves (c) of Theorem 2.
Now, we prove (a) of Theorem 2 only. Similarly, (b) of Theorem 2 can also be proved. From (A.7) and (a) of Lemma 3 we obtain
where \(D_1=B^{1/2}C_\textrm{W}^{-1}B^{1/2}\). Let \(D_2=\text{ diag }(w_1^*,\ldots ,w_{p-1}^*)\), where \(w_i^*,\ldots ,w_{p-1}^*\) are the eigenvalues of \(C_\textrm{W}^{-1}B\). Note that \(D_1\) and \(C_\textrm{W}^{-1}B\) have the same eigenvalues. Then there exists an orthogonal matrix S such that \(S^TD_2S=D_1\). Hence
From Lemma 2 we obtain that
Above two results together proves (a) of Theorem 2. This completes the proof of Theorem 2. \(\square \)
Proof of Theorem 3
It is similar to the proof of Theorem 4 in Wang et al. (2010), and hence we omit its proof. \(\square \)
Proof of Theorem 4
Using the standard argument, we can obtain
Therefore, from (2.4) it follows
when L = W, I and WI respectively, where \(\mu _2=\displaystyle \int \!\!t^2K(t)dt\), \(c_n=(nh_2)^{-1/2}+h_2^2\). By Theorem 4.4 of Masry and Tjøstheim (1995) as well as Slutsky’s theorem, Theorem 4 is proved. \(\square \)
Proof of Theorem 5
From (A.7). we can obtain
when L = W and I respectively. This, together with Slutsky’s theorem and Lemmas 2 and 3, proves Theorem 5. \(\square \)
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Xue, L., Xie, J. Efficient robust estimation for single-index mixed effects models with missing observations. Stat Papers 65, 827–864 (2024). https://doi.org/10.1007/s00362-023-01407-2
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00362-023-01407-2
Keywords
- Single-index mixed effects model
- Missing observation
- Weighted-imputed method
- Bias-correction technique
- Efficient robust estimator