Robust estimation of generalized partially linear model for longitudinal data with dropouts

  • Guoyou Qin
  • Zhongyi ZhuEmail author
  • Wing K. Fung


In this paper, we study the robust estimation of generalized partially linear models (GPLMs) for longitudinal data with dropouts. We aim at achieving robustness against outliers. To this end, a weighted likelihood method is first proposed to obtain the robust estimation of the parameters involved in the dropout model for describing the missing process. Then, a robust inverse probability-weighted generalized estimating equation is developed to achieve robust estimation of the mean model. To approximate the nonparametric function in the GPLM, a regression spline smoothing method is adopted which can linearize the nonparametric function such that statistical inference can be conducted operationally as if a generalized linear model was used. The asymptotic properties of the proposed estimator are established under some regularity conditions, and simulation studies show the robustness of the proposed estimator. In the end, the proposed method is applied to analyze a real data set.


Dropouts Partially linear models Regression splines Robustness 



The authors are grateful to the Editor and two referees for their constructive suggestions that largely improve the presentation of the paper. This work was partially supported by the National Natural Science Foundation of China (11371100, 11271080) and Shanghai Leading Academic Discipline Project, Project Number: B118.


  1. Cantoni, E., Ronchetti, E. (2001). Robust inference for generalized linear models. Journal of the American Statistical Association, 96, 1022–1030.Google Scholar
  2. Chen, B., Zhou, X.H. (2013). Generalized partially linear models for incomplete longitudinal data in the presence of population-level information. Biometrics, 69, 386–395.Google Scholar
  3. Diggle, P. J., Heagerty, P., Liang, K. Y., Zeger, S. L. (2002). Analysis of longitudinal data (2nd ed.). Oxford: Oxford University Press.Google Scholar
  4. Drake, R. E., McHugo, G. J., Clark, R. E., Teague, G. B., Xie, H., Miles, K., et al. (1998). Assertive community treatment for patients with co-occurring severe mental illness and substance use disorder: a clinical trial. American Journal of Orthopsychiatry, 68, 201–215.Google Scholar
  5. He, X., Zhu, Z.Y., Fung, W.K. (2002). Estimation in a semiparametric model for longitudinal data with unspecified dependence structure. Biometrika, 89, 579–590.Google Scholar
  6. He, X., Fung, W.K., Zhu, Z.Y. (2005). Robust estimation in generalized partial linear models for clustered data. Journal of the American Statistical Association, 100, 1176–1184.Google Scholar
  7. Huber, P. J. (1981). Robust statistics. New York: Wiley.CrossRefzbMATHGoogle Scholar
  8. Lian, H., Liang, H., Wang, L. (2014). Generalized additive partial linear models for clustered data with diverging number of covariates using GEE. Statistica Sinica, 24, 173–196.Google Scholar
  9. Little, R. J. A., Rubin, D. B. (2002). Statistical analysis with missing data (2nd ed.). NewYork: Wiley.Google Scholar
  10. Preisser, J. S., Lohman, K. K., Rathouz, P. J. (2002). Performance of weighted estimating equations for longitudinal binary data with drop-outs missing at random. Statistics in Medicine, 21, 3035–3054.Google Scholar
  11. Qin, G.Y., Zhu, Z.Y. (2007). Robust estimation in generalized semiparametric mixed models for longitudinal data. Journal of Multivariate Analysis, 98, 1658–1683.Google Scholar
  12. Qin, G.Y., Zhu, Z.Y. (2009). Robustified maximum likelihood estimation in generalized partial linear mixed model for longitudinal data. Biometrics, 65, 52–59.Google Scholar
  13. Qin, G.Y., Zhu, Z.Y., Fung, W.K. (2008). Robust estimating equations and bias correction of correlation parameters for longitudinal data. Computational Statistics and Data Analysis, 52, 4745–4753.Google Scholar
  14. Rice, J. (1986). Convergence rates for partially splined models. Statistics and Probability Letters, 4, 203–208.MathSciNetCrossRefzbMATHGoogle Scholar
  15. Robins, J. M., Rontnitzky, A., Zhao, L. P. (1995). Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. Journal of the American Statistical Association, 90, 106–121.Google Scholar
  16. Rousseeuw, P. J., van Zomeren, B. C. (1990). Unmasking multivariate outliers and leverage points. Journal of the American Statistical Association, 85, 633–639.Google Scholar
  17. Schumaker, L. L. (1981). Spline functions. New York: Wiley.zbMATHGoogle Scholar
  18. Sinha, S.K. (2004). Robust analysis of generalized linear mixed models. Journal of the American Statistical Association, 99, 451–460.Google Scholar
  19. Sinha, S.K. (2012). Robust analysis of longitudinal data with nonignorable missing responses. Metrika, 75, 913–938.Google Scholar
  20. Wang, Y. G., Lin, X., Zhu, M., Bai, Z. (2007). Robust estimation using the Huber function with a data-Dependent tuning constant. Journal of Computational and Graphical Statistics, 16, 468–481.Google Scholar
  21. Wang, J., Xie, H., Fisher, J. H. (2011). Multilevel models: applications using SAS. Beijing: China Higher Education Press.Google Scholar
  22. Yi, G.Y., He, W. (2009). Median regression models for longitudinal data with dropouts. Biometrics, 65, 618–625.Google Scholar

Copyright information

© The Institute of Statistical Mathematics, Tokyo 2015

Authors and Affiliations

  1. 1.Department of Biostatistics, School of Public Health and Key Laboratory of Public Health SafetyFudan UniversityShanghaiChina
  2. 2.Department of StatisticsFudan UniversityShanghaiChina
  3. 3.Department of Statistics and Actuarial ScienceThe University of Hong KongHong KongChina

Personalised recommendations