Skip to main content
Log in

Error rates in classification consisting of discrete and continuous variables in the presence of covariates

  • Notes
  • Published:
Statistical Papers Aims and scope Submit manuscript

Abstract

We consider classifying an object based on mixed continuous and discrete variables between two populations. Mixed discrete and continuous covariates with identical means in both populations are amongst the variables. Under the location model with homogeneous location specific conditional dispersion matrices for both populations, the Bayes rule is given. Classification is implemented by a plug-in version of the Bayes rule with full covariate adjustment. An asymptotic expansion of the overall expected error of the procedure is derived. Our findings generalize several classical results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Anderson, T.W. (1973), An asymptotic expansion of the distribution of the studentized classification statistics W, Ann. Statist. 1, 962–972.

    Google Scholar 

  • Anderson, T. W. (1984), An introduction to multivariate statistical analysis (Wiley, New York, 2nd ed.).

    MATH  Google Scholar 

  • Balakrishnan, N., Kocherlakota, S. and Kocherlakota, K. (1986), On the errors of misclassification based on dichotomous and normal variables, Ann. Inst. Statist, Math., 38, 529–538.

    Article  MATH  MathSciNet  Google Scholar 

  • Balakrishnan, N., and Tiku, M. L.(1988), Robust classification procedures based on dichotomous and continuous variables, J. of Classif. 5, 53–80.

    Article  MATH  MathSciNet  Google Scholar 

  • Cochran, W. G. (1966), Comparison of two methods of handling covariates in discriminatory analysis, Ann. Inst. Statist. Math. 16, 45–53.

    MathSciNet  Google Scholar 

  • Cochran, W. G. and Bliss, C. T.(1948), Discriminant function with co-variance, Ann. Math. Statist. 19, 151–176.

    Article  MATH  MathSciNet  Google Scholar 

  • Krzanowski, W. J. (1975), Discrimination and classification using both binary and continuous variables, J. Amer. Statist. Assoc. 70, 782–790.

    Article  MATH  Google Scholar 

  • Krzanowski, W. J. (1993a), Selection of variables, and assessment of their performance, in mixed-variable discriminant analysis, Cornput. Statist. & Data Anal. 19, 419–431.

    Article  Google Scholar 

  • Krzanowski, W. J. (1993b), The location model for mixtures of categorical and continuous variables, J. Classif. 10, 25–49.

    Article  MATH  Google Scholar 

  • Krzanowski, W. J. (1994), Quadratic location discriminant functions for mixed categorical and continuous data, Stat. & Prob. Lett. 19, 91–95.

    Article  MATH  Google Scholar 

  • Lachenbruch, P. A. and Mickey, M. R.(1968), Estimation of error rates in discriminant analysis, Technometrics 10, 1–11.

    Article  MathSciNet  Google Scholar 

  • Leung, C. Y. (1996), The location linear discriminant for classifying observations with unequal variances, Prob. & Stat. Lett. 31, 23–29.

    Article  MATH  Google Scholar 

  • Leung, C. Y. (1998), The covariance adjusted location linear discriminant function for classifying data with unequal dispersion matrices in different locations, Ann. Inst. Statist. Math. 50, 417–431.

    Article  MATH  MathSciNet  Google Scholar 

  • McLachlan, G. J. (1992), Discriminant analysis and pattern recognition (Wiley, New York).

    Book  Google Scholar 

  • Memon, A. Z. and Okamoto, M. (1970), The classification statistic W* in covariate discriminant analysis, Ann. Math. Statist. 41, 1491–1499.

    Article  MATH  MathSciNet  Google Scholar 

  • Searle, S. R. (1971). Linear Models, Wiley, New York.

    MATH  Google Scholar 

  • Srivastava, M. S. and Khatri, C. G. (1979). An Introduction to Multivariate Statistics, North Holland, New York.

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Leung, CY. Error rates in classification consisting of discrete and continuous variables in the presence of covariates. Statistical Papers 42, 265–273 (2001). https://doi.org/10.1007/s003620100055

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s003620100055

Key Words

Navigation