Abstract
In this paper, we study the robust estimation of generalized partially linear models (GPLMs) for longitudinal data with dropouts. We aim at achieving robustness against outliers. To this end, a weighted likelihood method is first proposed to obtain the robust estimation of the parameters involved in the dropout model for describing the missing process. Then, a robust inverse probability-weighted generalized estimating equation is developed to achieve robust estimation of the mean model. To approximate the nonparametric function in the GPLM, a regression spline smoothing method is adopted which can linearize the nonparametric function such that statistical inference can be conducted operationally as if a generalized linear model was used. The asymptotic properties of the proposed estimator are established under some regularity conditions, and simulation studies show the robustness of the proposed estimator. In the end, the proposed method is applied to analyze a real data set.
Similar content being viewed by others
References
Cantoni, E., Ronchetti, E. (2001). Robust inference for generalized linear models. Journal of the American Statistical Association, 96, 1022–1030.
Chen, B., Zhou, X.H. (2013). Generalized partially linear models for incomplete longitudinal data in the presence of population-level information. Biometrics, 69, 386–395.
Diggle, P. J., Heagerty, P., Liang, K. Y., Zeger, S. L. (2002). Analysis of longitudinal data (2nd ed.). Oxford: Oxford University Press.
Drake, R. E., McHugo, G. J., Clark, R. E., Teague, G. B., Xie, H., Miles, K., et al. (1998). Assertive community treatment for patients with co-occurring severe mental illness and substance use disorder: a clinical trial. American Journal of Orthopsychiatry, 68, 201–215.
He, X., Zhu, Z.Y., Fung, W.K. (2002). Estimation in a semiparametric model for longitudinal data with unspecified dependence structure. Biometrika, 89, 579–590.
He, X., Fung, W.K., Zhu, Z.Y. (2005). Robust estimation in generalized partial linear models for clustered data. Journal of the American Statistical Association, 100, 1176–1184.
Huber, P. J. (1981). Robust statistics. New York: Wiley.
Lian, H., Liang, H., Wang, L. (2014). Generalized additive partial linear models for clustered data with diverging number of covariates using GEE. Statistica Sinica, 24, 173–196.
Little, R. J. A., Rubin, D. B. (2002). Statistical analysis with missing data (2nd ed.). NewYork: Wiley.
Preisser, J. S., Lohman, K. K., Rathouz, P. J. (2002). Performance of weighted estimating equations for longitudinal binary data with drop-outs missing at random. Statistics in Medicine, 21, 3035–3054.
Qin, G.Y., Zhu, Z.Y. (2007). Robust estimation in generalized semiparametric mixed models for longitudinal data. Journal of Multivariate Analysis, 98, 1658–1683.
Qin, G.Y., Zhu, Z.Y. (2009). Robustified maximum likelihood estimation in generalized partial linear mixed model for longitudinal data. Biometrics, 65, 52–59.
Qin, G.Y., Zhu, Z.Y., Fung, W.K. (2008). Robust estimating equations and bias correction of correlation parameters for longitudinal data. Computational Statistics and Data Analysis, 52, 4745–4753.
Rice, J. (1986). Convergence rates for partially splined models. Statistics and Probability Letters, 4, 203–208.
Robins, J. M., Rontnitzky, A., Zhao, L. P. (1995). Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. Journal of the American Statistical Association, 90, 106–121.
Rousseeuw, P. J., van Zomeren, B. C. (1990). Unmasking multivariate outliers and leverage points. Journal of the American Statistical Association, 85, 633–639.
Schumaker, L. L. (1981). Spline functions. New York: Wiley.
Sinha, S.K. (2004). Robust analysis of generalized linear mixed models. Journal of the American Statistical Association, 99, 451–460.
Sinha, S.K. (2012). Robust analysis of longitudinal data with nonignorable missing responses. Metrika, 75, 913–938.
Wang, Y. G., Lin, X., Zhu, M., Bai, Z. (2007). Robust estimation using the Huber function with a data-Dependent tuning constant. Journal of Computational and Graphical Statistics, 16, 468–481.
Wang, J., Xie, H., Fisher, J. H. (2011). Multilevel models: applications using SAS. Beijing: China Higher Education Press.
Yi, G.Y., He, W. (2009). Median regression models for longitudinal data with dropouts. Biometrics, 65, 618–625.
Acknowledgments
The authors are grateful to the Editor and two referees for their constructive suggestions that largely improve the presentation of the paper. This work was partially supported by the National Natural Science Foundation of China (11371100, 11271080) and Shanghai Leading Academic Discipline Project, Project Number: B118.
Author information
Authors and Affiliations
Corresponding author
About this article
Cite this article
Qin, G., Zhu, Z. & Fung, W.K. Robust estimation of generalized partially linear model for longitudinal data with dropouts. Ann Inst Stat Math 68, 977–1000 (2016). https://doi.org/10.1007/s10463-015-0519-8
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10463-015-0519-8