Abstract
Datasets involving repeated measurements over time are common in medical trials and epidemiological cohort studies. The outcomes and covariates are usually observed from randomly selected subjects, each at a set of possibly unequally spaced time design points. One useful approach for evaluating the effects of covariates is to consider linear models at a specific time, but the coefficients are smooth curves over time. We show that kernel estimators of the coefficients that are based on ordinary local least squares may be subject to large biases when the covariates are time-dependent. As a modification, we propose a two-step kernel method that first centers the covariates and then estimates the curves based on some local least squares criteria and the centered covariates. The practical superiority of the two-step kernel method over the ordinary least squares kernel method is shown through a fetal growth study and simulations. Theoretical properties of both the two-step and ordinary least squares kernel estimators are developed through their large sample mean squared risks.
Similar content being viewed by others
References
Altman, N. S. (1990). Kernel smoothing of data with correlated errors, J. Amer. Statist. Assoc., 85, 749–759.
Davidian, M. and Giltinana, D. M. (1995). Nonlinear Models for Repeated Measurement Data, Chapman Hall, London.
Diggle, P. J., Liang, K.-Y. and Zeger, S. L. (1994). Analysis of Longitudinal Data, Oxford University Press, Oxford, England.
Härdle, W. (1990). Applied Nonparametric Regression, Cambridge University Press, Cambridge, England.
Hart, J. D. and Wehrly, T. E. (1986). Kernel regression estimation using repeated measurements data, J. Amer. Statist. Assoc., 81, 1080–1088.
Hart, J. D. and Wehrly, T. E. (1993). Consistency of cross-validation when the data are curves, Stochastic Process. Appl., 45, 351–361.
Hastie, T. J. and Tibshirani, R. J. (1993). Varying-coefficient models, J. Roy. Statist. Soc. Ser. B, 55, 757–796.
Hoover, D. R., Rice, J. A., Wu, C. O. and Yang, L.-P. (1998). Nonparametric smoothing estimates of time-varying coefficient models with longitudinal data, Biometrika, 85, 809–822.
Moyeed, R. A. and Diggle, P. J. (1994). Rates of convergence in semiparametric modeling of longitudinal data, Austral. J. Statist., 36, 75–93.
Nadaraya, E. A. (1964). On estimating regression, Theory Probab. Appl., 10, 186–190.
Rice, J. A. and Silverman, B. W. (1991). Estimating the mean and covariance structure nonparametrically when the data are curves, J. Roy. Statist. Soc. Ser. B, 53, 233–243.
Rosenblatt, M. (1969). Conditional probability density and regression estimates, Multivariate Analysis II (ed. Krishnaiah), 25–31, Academic Press, New York.
Vonesh, E. F. and Chinchilli, V. M. (1997). Linear and Nonlinear Models for the Analysis of Repeated Measurements, Marcel Dekker, New York.
Watson, G. S. (1964). Smooth regression analysis, Sankhyā Series A, 26, 359–372.
Zeger, S. L. and Diggle, P. J. (1994). Semiparametric models for longitudinal data with application to CD4 cell numbers in HIV seroconverters, Biometrics, 50, 689–699.
Author information
Authors and Affiliations
About this article
Cite this article
Wu, C.O., Yu, K.F. & Chiang, CT. A Two-Step Smoothing Method for Varying-Coefficient Models with Repeated Measurements. Annals of the Institute of Statistical Mathematics 52, 519–543 (2000). https://doi.org/10.1023/A:1004125621021
Issue Date:
DOI: https://doi.org/10.1023/A:1004125621021