Abstract
Clustered data arise commonly in practice and it is often of interest to estimate the mean response parameters as well as the association parameters. However, most research has been directed to inference about the mean response parameters with the association parameters relegated to a nuisance role. There is little work concerning both the marginal and association structures, especially in the semiparametric framework. In this paper, our interest centers on inference on the association parameters in addition to the mean parameters. We develop semiparametric methods for both complete and incomplete clustered binary data and establish the theoretical results. The proposed methodology is illustrated through numerical studies.
Similar content being viewed by others
References
Aerts M., Claeskens G. (1997) Local polynomial estimation in multiparameter likelihood models. Journal of the American Statistical Association 92: 1536–1545
Carey V., Zeger S.L., Diggle P.J. (1993) Modelling multivariate binary data with alternating logistic regressions. Biometrika 80: 517–526
Carroll R.J., Fan J., Gijbels I., Wand M.P. (1997) Generalized partially linear single-index models. Journal of the American Statistical Association 92: 477–489
Chaudhuri P., Doksum K., Samarov A. (1997) On average derivative quantile regression. The Annals of Statistics 25: 715–744
Chen K., Jin Z. (2005) Local polynomial regression analysis of clustered data. Biometrika 92: 59–74
Claeskens G., Aerts M. (2000) Bootstrapping local polynomial estimators in likelihood-based models. Journal of Statistical Planning and Inference 86: 63–80
Fan J., Li R. (2004) New estimation and model selection procedures for semiparametric modeling in longitudinal data analysis. Journal of the American Statistical Association 99: 710–723
Fan J., Heckman N.E., Wand M.P. (1995) Local polynomial kernel regression for generalized linear models and quasi-likelihood functions. Journal of the American Statistical Association 90: 141–150
Fitzmaurice G.M., Laird N.M. (1993) A likelihood-based method for analysing longitudinal binary responses. Biometrika 80: 141–151
Hall D.B., Severini T.A. (1998) Extended generalized estimating equations for clustered data. Journal of the American Statistical Association 93: 1365–1375
Huang J.Z., Liu L. (2006) Polynomial spline estimation and inference of proportional hazards regression models with flexible relative risk form. Biometrics 62: 793–802
Ichimura H. (1993) Semiparametric least squares (SLS) and weighted SLS estimation of single-index models. Journal of Econometrics 58: 71–120
Kim S., Chen M.H., Dey D.K. (2008) Flexible generalized t-link models for binary response data. Biometrika 95: 93–106
Kraft P., Bauman L., Yuan J.Y., Horvath S. (2003) Multivariate variance-components analysis of longitudinal blood pressure measurements from the Framingham heart study. BMC Genetics 4(Suppl 1): S55
Liang K.Y., Zeger S.L. (1986) Longitudinal data analysis using generalized linear models. Biometrika 73: 13–22
Liang H., Wang S., Robins J.M., Carroll R.J. (2004) Estimation in partially linear models with missing covariates. Journal of the American Statistical Association 99: 357–367
Lin X.H., Carroll R.J. (2001a) Semiparametric regression for clustered data using generalized estimating equations. Journal of the American Statistical Association 96: 1045–1056
Lin X.H., Carroll R.J. (2001b) Semiparametric regression for clustered data. Biometrika 88: 1179–1185
Lipsitz S.R., Laird N.M., Harrington D.P. (1991) Generalized estimating equations for correlated binary data: using the odds ratio as a measure of association. Biometrika 78: 153–160
Little R.J.A., Rubin D.B. (2002) Statistical analysis with missing data, 2nd edn. Wiley, New York
Molenberghs G., Lesaffre E. (1994) Marginal modelling of correlated ordinal data using an n-Way plackett distribution. Journal of the American Statistical Association 89: 633–644
Pepe M.S., Anderson G.L. (1994) A cautionary note on inference for marginal regression models with longitudinal data and general correlated response data. Communications in Statistics, Simulation and Computation 23: 939–951
Prentice R.L. (1988) Correlated binary regression with covariates specific to each binary observation. Biometrics 44: 1033–1048
Ruppert D., Sheather S.J., Wand M.P. (1995) An effective bandwidth selector for local least squares regression. Journal of the American Statistical Association 90: 1257–1270
Severini T.A., Staniswalis J.G. (1994) Quasilikelihood estimation in semiparametric models. Journal of the American Statistical Association 89: 501–511
Wang N. (2003) Marginal nonparametric kernel regression accounting for within-subject correlation. Biometrika 90: 43–52
Wang N., Carroll R.J., Lin X. (2005) Efficient semiparametric marginal estimation for longitudinal/clustered data. Journal of the American Statistical Association 100: 147–157
Xia Y. (2007) A constructive approach to the estimation of dimension reduction directions. The Annals of Statistics 35: 2654–2690
Xia Y., Härdle W. (2006) Semi-parametric estimation of partially linear single-index models. Journal of Multivariate Analysis 97: 1162–1184
Xia Y., Tong H., Li W.K. (1999) On extended partially linear single-index models. Biometrika 86: 831–842
Yi G.Y., Cook R.J. (2002) Marginal methods for incomplete longitudinal data arising in clusters. Journal of the American Statistical Association 97: 1071–1080
Yi G.Y., He W., Liang H. (2009) Analysis of correlated binary data under partially linear single-index logistic models. Journal of Multivariate Analysis 100: 278–290
Yi G.Y., Thompson M.E. (2005) Marginal and association regression models for longitudinal binary data with drop-outs: a likelihood-based approach. The Canadian Journal of Statistics 33: 3–20
Zeger S.L., Diggle P.J. (1994) Semi-parametric models for longitudinal data with applications to CD4 cell numbers in HIV seroconverters. Biometrics 50: 689–699
Zeger S.L., Liang K.Y. (1986) Longitudinal data analysis for discrete and continuous outcomes. Biometrics 42: 121–130
Author information
Authors and Affiliations
Corresponding author
About this article
Cite this article
Yi, G.Y., He, W. & Liang, H. Semiparametric marginal and association regression methods for clustered binary data. Ann Inst Stat Math 63, 511–533 (2011). https://doi.org/10.1007/s10463-009-0239-z
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10463-009-0239-z