Abstract
Methods are proposed to choose classifiers from a given collection of classifiers when there are missing covariates in the data. Two situations are considered: (i) the case where the new observation (to be classified) has no missing covariates and (ii) the case where the new observation is also allowed to have missing covariates. Using arguments from the empirical process theory, exponential performance bounds will be derived for the resulting classifiers. Such bounds, together with the Borel-Cantelli lemma, yield various strong consistency results.
Similar content being viewed by others
References
Bennett G (1962) Probability inequalities for the sum of independent random variables. J Am Stat Assoc 57: 33–45
Chenouri S, Mojirsheibani M, Montazeri Z (2009) Empirical Measures for incomplete data with applications. Electron J Stat 3: 1021–1038
Devroye L, Györfi L, Lugosi G (1996) A probabilistic theory of pattern recognition. Springer, New York
Dudley R (1978) Central limit theorems for empirical measures. Ann Prob 6: 899–929
Györfi L, Kohler M, Krzyzak A, Walk H (2002) A distribution-free theory of nonparametric regression. Springer, New York
Little RJA, Rubin DB (2002) Statistical analysis with missing data. Wiley, New York
Mojirsheibani M, Montazeri Z (2007) Statistical classification with missing covariates. J R Stat Soc Ser B 69: 839–857
Pollard D (1984) Convergence of stochastic processes. Springer, New York
Van der Vaart AW, Wellner JA (1996) Weak convergence and empirical processes with application to statistics. Springer, New York
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Mojirsheibani, M. Some results on classifier selection with missing covariates. Metrika 75, 521–539 (2012). https://doi.org/10.1007/s00184-010-0340-6
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00184-010-0340-6