Abstract
This paper develops a weighted composite quantile regression method for linear models where some covariates are missing not at random but the missingness is conditionally independent of the response variable. It is known that complete case analysis (CCA) is valid under these missingness assumptions. By fully utilizing the information from incomplete data, empirical likelihood-based weights are obtained to conduct the weighted composite quantile regression. Theoretical results show that the proposed estimator is more efficient than the CCA one if the probability of missingness on the fully observed variables is correctly specified. Besides, the proposed algorithm is computationally simple and easy to implement. The methodology is illustrated on simulated data and a real data set.
Similar content being viewed by others
References
Bartlett JW, Carpente JR, Tilling K, Vansteelandt S (2014) Improving upon the efficiency of complete case analysis when covariates are MNAR. Biostatistics 15(4):719–730
Bloznelis D, Claeskens G, Zhou J (2019) Composite versus model-averaged quantile regression. J Stat Plan Inference 200:32–46
Bradic J, Fan JQ, Wang WW (2011) Penalized composite quasi-likelihood for ultrahigh dimensional variable selection. J R Stat Soc Ser B 73(3):325–349
Jiang XJ, Jiang JC, Song XY (2012) Oracle model selection for nonlinear models based on weighted composite quantile regression. Stat Sin 22(4):1479–1506
Koenker R (2005) Quantile regression. Cambridge University Press, New York
Little RJ, Rubin DB (2002) Statistical analysis with missing data, 2nd edn. Wiley, Hoboken
Little RJ, Zhang N (2011) Subsample ignorable likelihood for regression analysis with missing data. J R Stat Soc Ser C 60(4):591–605
Liu TQ, Yuan XH (2016) Weighted quantile regression with missing covariates using empirical likelihood. Statistics 50(1):89–113
Molanes Lopez EM, Van Keilegom I, Veraverbeke N (2009) Empirical likelihood for non-smooth criterion functions. Scand J Stat 36(3):413–432
Ning ZJ, Tang LJ (2014) Estimation and test procedures for composite quantile regression with covariates missing at random. Stat Probab Lett 95:15–25
Owen AB (1990) Empirical likelihood ratio confidence regions. Ann Stat 18(1):90–120
Pollard D (1991) Asymptotics for least absolute deviation regression estimators. Econ Theory 7(2):186–199
Qin J, Lawless J (1994) Empirical likelihood and general estimating equations. Ann Stat 22(1):300–325
Sherwood B (2016) Variable selection for additive partial linear quantile regression with missing covariates. J Multivar Anal 152:206–223
Sun J, Ma YY (2017) Empirical likelihood weighted composite quantile regression with partially missing covariates. J Nonparametr Stat 29(1):137–150
Sun J, Sun QH (2015) An improved and efficient estimation method for varying-coefficient model with missing covariates. Stat Probab Lett 107:296–303
Sun J, Gai YJ, Lin L (2013) Weighted local linear composite quantile estimation for the case of general error distributions. J Stat Plan Inference 143:1049–1063
Tang NS, Wang XJ (2019) Robust estimation of generalized estimating equations with finite mixture correlation matrices and missing covariates at random for longitudinal data. J Multivar Anal 173:640–655
Tang LJ, Zhou ZG (2015) Weighted local linear CQR for varying-coefficient models with missing covariates. TEST 24(3):583–604
Tang LJ, Zheng SC, Zhou ZG (2018) Estimation and inference of combining quantile and least-square regressions with missing data. J Korean Stat Soc 47:77–89
Yoshida T (2017) Two stage smoothing in additive models with missing covariates. Stat Pap 60(6):1803–1826
Zhao ZB, Xiao ZJ (2014) Efficient regressions via optimally combining quantile information. Econ Theory 30(6):1272–1314
Zou H, Yuan M (2008) Composite quantile regression and the oracle model selection theory. Ann Stat 36(3):1108–1126
Acknowledgements
This research was supported by the Natural Science Foundation of Shandong Province, China (ZR2017QA011). The author would like to thank the anonymous reviewers for their valuable comments and suggestions.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
The following regularity conditions are required for the asymptotic analysis. Conditions (C1, C2, C6–C8) are similar to those in Sun and Ma (2017). Condition (C3) is a necessary assumption about the missing data mechanism in this paper. Condition (C4–C5) guarantee that the asymptotic covariance matrices of the CCA-based and ACC-based CQR estimators are both positive definite.
-
(C1)
\(\varvec{w}\) has a bounded support.
-
(C2)
The density function \(f(\cdot )\) of \(\varepsilon \) is bounded away from zero and has a continuous and uniformly bounded derivative.
-
(C3)
\(\delta \) is independent of y given \(\varvec{w}\).
-
(C4)
\(\varvec{D}\) and \(\varvec{S_\phi }\) are positive definite.
-
(C5)
\({\varvec{S}}_{\varvec{B}}\), \(\varvec{P}_2\) and \(\varvec{S_\phi }-\varvec{P}_1\varvec{P}_2^{-1}\varvec{P}_1^T\) are positive definite.
-
(C6)
For all \((y_i,\varvec{z}_i)\), \(\pi (y_i,\varvec{z}_i,\varvec{\gamma })\) admits all third partial derivatives \(\frac{\partial ^3\pi (y_i,\varvec{z}_i,\varvec{\gamma })}{\partial \gamma _k\partial \gamma _l\partial \gamma _m}\) for all \(\varvec{\gamma }\) in a neighborhood of the true value \(\varvec{\gamma }^*\), \(\Vert \frac{\partial ^3\pi (y_i,\varvec{z}_i,\varvec{\gamma })}{\partial \gamma _k\partial \gamma _l\partial \gamma _m}\Vert \) is bounded by an integrable function for all \(\varvec{\gamma }\) in this neighborhood, and \(\Vert \frac{\partial \pi (y_i,\varvec{z}_i,\varvec{\gamma })}{\partial \varvec{\gamma }^T}\Vert \) is bounded by an integrable function for all \(\varvec{\gamma }\) in this neighborhood.
-
(C7)
\(\pi (y,\varvec{z},\varvec{\gamma }^*)\) is bounded away from zero, i.e. \(\underset{y,z}{\inf }\,\pi (y,\varvec{z},\varvec{\gamma }^*)\ge c_0\) for some \(c_0 > 0\).
-
(C8)
\(\Vert \varvec{\xi }(y,\varvec{z},\varvec{\zeta })\Vert ^2\) is bounded by an integrable function for all \(\varvec{\zeta }\) in a neighbourhood of \(\varvec{\zeta }^*\) and \(\varvec{\xi }(y,\varvec{z},\varvec{\zeta })\) is continuous at each \(\varvec{\zeta }\) with probability one in this neighbourhood, where \(\varvec{\zeta }=(\varvec{\alpha }^T,\varvec{\beta }^T,\varvec{b}^T)^T\) and \(\varvec{\zeta }^*=({\varvec{\alpha }^*}^T,{\varvec{\beta }^*}^T,{\varvec{b}^*}^T)^T\). For some \(c > 0\),
For convenience of representation, for \(i=1,\ldots ,n\) and \(k=1,\ldots ,q\), write
Lemma 1
-
(i)
\(det(\varvec{S}_{\varvec{g}})= det(\varvec{P}_2)det(\varvec{S}_B)\), where \(det(\varvec{A})\) denotes the determinant of some matrix \(\varvec{A}\).
-
(ii)
\(\varvec{G_\gamma }=-E(\varvec{g}_i\varvec{U}_{B_i}^T)\), \(\varvec{F_\gamma } =-E(\varvec{\phi }_i\varvec{U}_{B_i}^T)\). Thus \(\varvec{F_\gamma }=\varvec{F_g}\varvec{S_g}^{-1}\varvec{G_\gamma }\).
-
(iii)
\(\varvec{F}_{\varvec{g}}\varvec{S}_{\varvec{g}}^{-1}\varvec{F}_{\varvec{g}}^T=\varvec{P}_1\varvec{P}_2^{-1}\varvec{P}_1^T+E(\varvec{\phi }_i\varvec{U}_{B_i}^T)\varvec{S}_{B}^{-1}E(\varvec{U}_{B_i}\varvec{\phi }_i^T)\), \(\varvec{F}_{\varvec{g}}\varvec{S}_{\varvec{g}}^{-1}\varvec{G}_{\varvec{\gamma }}=-E(\varvec{\phi }_i\varvec{U}_{B_i}^T)\), \(\varvec{S}_{\varvec{g}}^{-1}\varvec{G}_{\varvec{\gamma }}= \begin{pmatrix}\varvec{0}^T, -\varvec{I}^T \end{pmatrix}^T.\)
Proof of Lemma 1
The results can be derived by some calculations and the details are omitted. \(\square \)
Proof of Theorem 1
The proof of Theorem 1 is similar to and much easier than that of Theorem 2. The details are omitted. \(\square \)
Proof of Theorem 2
For fixed estimators \(\hat{\varvec{\kappa }}\), the Lagrange multiplier \(\hat{\varvec{\lambda }}\) satisfies the constraints equations \({\varvec{\Lambda }}(\hat{\varvec{\lambda }},\hat{\varvec{\kappa }})=0.\) According to Lemma A.1 in Sun and Ma (2017),
By Taylor expansion,
where \(\tilde{\varvec{\kappa }}=(\tilde{\varvec{\alpha }}^T,\tilde{\varvec{\beta }}^T,\tilde{\varvec{b}}^T,\tilde{\varvec{\gamma }}^T)^T\in \{\varvec{\kappa }:\Vert \varvec{\kappa }-\varvec{\kappa }^*\Vert \le cn^{-1/2}\}\) with a positive constant c. According to the law of large numbers,
By combining these with (4.1) and
it holds that
Let \(\varvec{u}=\sqrt{n}(\varvec{\beta }-\varvec{\beta }^*)\), \(\hat{\varvec{u}}=\sqrt{n}(\hat{\varvec{\beta }}_{acc}-\varvec{\beta }^*)\), \(v_k=\sqrt{n}(b_k-b_k^*)\) and \(\hat{v}_k=\sqrt{n}(\hat{b}_{k,acc}-b_k^*)\), \(k=1,\ldots ,q\). Let \(\varvec{\theta }=(\varvec{u}^T,v_1,\ldots ,v_q)^T\) and \(\hat{\varvec{\theta }}=(\hat{\varvec{u}}^T,\hat{v}_1,\ldots ,\hat{v}_q)^T\), then \(\hat{\varvec{\theta }}\) is the minimizer of
where
Owing to (2.4),
Then by substituting (4.2) and (4.4) into \(\varvec{\Pi }_n\), it holds that
Based on some calculations and Lemma 1 (ii),
Substituting these into (4.5) leads to
By some calculations, it holds that
and \(\varvec{\Pi }_n{\mathop {\rightarrow }\limits ^{d}}\varvec{\Pi }_0\), where \(\varvec{\Pi }_0\) is a random vector following \(N(\varvec{0},\varvec{S_\phi }-\varvec{P}_1\varvec{P}_2^{-1}\varvec{P}_1^T)\).
Similarly, for \(k=1,\ldots ,q\),
and \(z_{n,k}{\mathop {\rightarrow }\limits ^{d}}z_{k}\), where \(z_{k}\) follows some one-dimensional normal distribution.
For \(i=1,\ldots ,n\) and \(k=1,\ldots ,q\), write \(A_{i,k}=\int _{0}^{(v_k+\varvec{w}_i^T\varvec{u})/\sqrt{n}}\big (I(\varepsilon _i\le b_k^*+s)-I(\varepsilon _i\le b_k^*)\big )\hbox {d}s\). Then
Based on some derivations,
Therefore
By substituting the expression of \(B_{n,k}\) into (4.3),
According to the convexity lemma in Pollard (1991), it follows that
Finally, combining this with \(\varvec{\Pi }_n{\mathop {\rightarrow }\limits ^{d}}\varvec{\Pi }_0\) leads to
\(\square \)
Proof of Theorem 3
Write \(\varvec{M}_i=\varvec{M}(\varvec{t}_i,\varvec{\beta }^*,\varvec{\gamma }^*)\). According to the proof of Lemma 6 in Molanes Lopez et al. (2009), it holds that
where
Define \(\varvec{B}_{11.2}=\varvec{B}_{11}-\varvec{B}_{12}\varvec{B}_{22}^{-1}\varvec{B}_{12}^T=\varvec{S}_{\varvec{\phi }}-\varvec{F}_{\varvec{g}}\varvec{S}_{\varvec{g}}^{-1}\varvec{F}_{\varvec{g}}^T\). According to Lemma 1 (iii),
Then it is easy to derive that
where \(\varvec{H}_{22.1}=\varvec{H}_{22}-\varvec{H}_{12}^T\varvec{H}_{11}^{-1}\varvec{H}_{12}=\varvec{S}_B\). Note that
Therefore
\(\square \)
Rights and permissions
About this article
Cite this article
Sun, J. An improvement on the efficiency of complete-case-analysis with nonignorable missing covariate data. Comput Stat 35, 1621–1636 (2020). https://doi.org/10.1007/s00180-020-00964-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-020-00964-6