Abstract
We consider classifying an object based on mixed continuous and discrete variables between two populations. Mixed discrete and continuous covariates with identical means in both populations are amongst the variables. Under the location model with homogeneous location specific conditional dispersion matrices for both populations, the Bayes rule is given. Classification is implemented by a plug-in version of the Bayes rule with full covariate adjustment. An asymptotic expansion of the overall expected error of the procedure is derived. Our findings generalize several classical results.
Similar content being viewed by others
References
Anderson, T.W. (1973), An asymptotic expansion of the distribution of the studentized classification statistics W, Ann. Statist. 1, 962–972.
Anderson, T. W. (1984), An introduction to multivariate statistical analysis (Wiley, New York, 2nd ed.).
Balakrishnan, N., Kocherlakota, S. and Kocherlakota, K. (1986), On the errors of misclassification based on dichotomous and normal variables, Ann. Inst. Statist, Math., 38, 529–538.
Balakrishnan, N., and Tiku, M. L.(1988), Robust classification procedures based on dichotomous and continuous variables, J. of Classif. 5, 53–80.
Cochran, W. G. (1966), Comparison of two methods of handling covariates in discriminatory analysis, Ann. Inst. Statist. Math. 16, 45–53.
Cochran, W. G. and Bliss, C. T.(1948), Discriminant function with co-variance, Ann. Math. Statist. 19, 151–176.
Krzanowski, W. J. (1975), Discrimination and classification using both binary and continuous variables, J. Amer. Statist. Assoc. 70, 782–790.
Krzanowski, W. J. (1993a), Selection of variables, and assessment of their performance, in mixed-variable discriminant analysis, Cornput. Statist. & Data Anal. 19, 419–431.
Krzanowski, W. J. (1993b), The location model for mixtures of categorical and continuous variables, J. Classif. 10, 25–49.
Krzanowski, W. J. (1994), Quadratic location discriminant functions for mixed categorical and continuous data, Stat. & Prob. Lett. 19, 91–95.
Lachenbruch, P. A. and Mickey, M. R.(1968), Estimation of error rates in discriminant analysis, Technometrics 10, 1–11.
Leung, C. Y. (1996), The location linear discriminant for classifying observations with unequal variances, Prob. & Stat. Lett. 31, 23–29.
Leung, C. Y. (1998), The covariance adjusted location linear discriminant function for classifying data with unequal dispersion matrices in different locations, Ann. Inst. Statist. Math. 50, 417–431.
McLachlan, G. J. (1992), Discriminant analysis and pattern recognition (Wiley, New York).
Memon, A. Z. and Okamoto, M. (1970), The classification statistic W* in covariate discriminant analysis, Ann. Math. Statist. 41, 1491–1499.
Searle, S. R. (1971). Linear Models, Wiley, New York.
Srivastava, M. S. and Khatri, C. G. (1979). An Introduction to Multivariate Statistics, North Holland, New York.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Leung, CY. Error rates in classification consisting of discrete and continuous variables in the presence of covariates. Statistical Papers 42, 265–273 (2001). https://doi.org/10.1007/s003620100055
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/s003620100055