Annals of the Institute of Statistical Mathematics

, Volume 58, Issue 4, pp 687–706

Semiparametric Maximum Likelihood for Missing Covariates in Parametric Regression


    • Division of BiostatisticsU.S. Food and Drug Administration
  • Howard E. Rockette
    • Department of BiostatisticsUniversity of Pittsburgh

DOI: 10.1007/s10463-006-0047-7

Cite this article as:
Zhang, Z. & Rockette, H.E. AISM (2006) 58: 687. doi:10.1007/s10463-006-0047-7


We consider parameter estimation in parametric regression models with covariates missing at random. This problem admits a semiparametric maximum likelihood approach which requires no parametric specification of the selection mechanism or the covariate distribution. The semiparametric maximum likelihood estimator (MLE) has been found to be consistent. We show here, for some specific models, that the semiparametric MLE converges weakly to a zero-mean Gaussian process in a suitable space. The regression parameter estimate, in particular, achieves the semiparametric information bound, which can be consistently estimated by perturbing the profile log-likelihood. Furthermore, the profile likelihood ratio statistic is asymptotically chi-squared. The techniques used here extend to other models.


Asymptotic normalityEfficiencyInfinite-dimensional M-estimationMissing at randomMissing covariatesParametric regressionProfile likelihoodSemiparametric likelihood

Copyright information

© The Institute of Statistical Mathematics, Tokyo 2006