Advances in Multivariate Statistical Analysis pp 233-252 | Cite as
Error Rate Estimation in Discriminant Analysis: Recent Advances
Abstract
An important problem in discriminant analysis is the estimation of the error rates associated with a given discriminant rule for allocating an object of unknown origin to one of a finite number, say g, of distinct classes or populations. The rule is based on the observed value of a random vector X of p measurements on the object. Over the years there have been many investigations on this problem; see, for example, Hills (1966), Lachenbruch and Mickey (1968), and McLachlan (1974a, b, c), and the references therein. Toussaint (1974) has compiled an extensive bibliography, which has been updated recently by Hand (1986b). An overview of error rate estimation has been given by McLachlan (1986), while recent work on robust error rate estimation has been summarized by Knoke (1986).
Keywords
Error Rate Mean Square Error Posterior Probability Allocation Rule Classification Error RatePreview
Unable to display preview. Download preview PDF.
References
- Anderson, T.W. (1951). ‘Classification by multivariate analysis’. Psychometrika 16, 31–52.MathSciNetCrossRefGoogle Scholar
- Anderson, T.W. (1984). ‘An Introduction to Multivariate Statistical Analysis’. New York: Wiley.MATHGoogle Scholar
- Basford, K.E. and McLachlan, G.J. (1985). ‘Estimation of allocation rates in a cluster analysis context’. J. Amer. Statist. Assoc. 80, 286–293.MathSciNetCrossRefGoogle Scholar
- Chatterjee, S. and Chatterjee, S. (1983). ‘Estimation of misclassification probabilities by bootstrap methods’. Commun. Statist.-Simula. Computa. 12, 645–656.CrossRefGoogle Scholar
- Chernick, M.R., Murthy, V.K. and Nealy, C.D. (1985). ‘Application of bootstrap and other resampling techniques: evaluation of classifier performance’. Pattern Recognition Letters 3, 167–178.CrossRefGoogle Scholar
- Chernick, M.R., Murthy, V.K., and Nealy, C.D. (1986a). ‘Correction note to Application of bootstrap and other resampling techniques: evaluation of classifier performance’. Pattern Recognition Letters 4, 133–142.CrossRefGoogle Scholar
- Chernick, M.R., Murthy, V.K., and Nealy, C.D. (1986b). ‘Estimation of error rate for linear discriminant functions by resampling: Non-Gaussian populations’. Unpublished manuscript.Google Scholar
- Efron, B. (1979). ‘Bootstrap methods: another look at the jackknife’. Ann. Statist. 7, 1–26.MathSciNetMATHCrossRefGoogle Scholar
- Efron, B. (1982). ‘The Jackknife, the Bootstrap, and Other Resampling Plans’. Philadelphia: SIAM.Google Scholar
- Efron, B. (1983). ‘Estimating the error rate of a prediction rule: improvement on cross-validation’. J. Amer. Statist. Assoc. 78, 316–331.MathSciNetMATHCrossRefGoogle Scholar
- Efron, B. (1986). ‘How biased is the apparent error rate of a logistic regression?’ J. Amer. Statist. Assoc. 81, 461–470.Google Scholar
- Fukunaga, K. and Kessell, D.L. (1973). ‘Nonparametric Bayes error estimation using unclassified samples’. IEEE Trans. Inf. Theory IT-19, 434–440.Google Scholar
- Ganesalingam, S. and McLachlan, G.J. (1980). ‘Error rate estimation on the basis of posterior probabilities’. Pattern Recognition 12, 405–413.MATHCrossRefGoogle Scholar
- Glick, N. (1978). ‘Additive estimators for probabilities of correct classification’. Pattern Recognition 10, 211–222.MATHCrossRefGoogle Scholar
- Gong, G. (1986). ‘Cross-validation, the jackknife, and the bootstrap: excess error estimation in forward logistic regression’. J. Amer. Statist. Assoc. 81, 108–113.CrossRefGoogle Scholar
- Hall, P. (1986). ‘Cross-validation in nonparametric density estimation’. Proc. XIIIth Int. Biometric Conference, 15pp. Seattle: Biometric Society.Google Scholar
- Hand, D.J. (1982). ‘Kernel Discriminant Analysis’. Chichester: Wiley.MATHGoogle Scholar
- Hand, D.J. (1986a). ‘Cross-validation in error rate estimation’. Proc. XIIIth Int.Biometric Conference, 15pp. Seattle: Biometric Society.Google Scholar
- Hand, D.J. (1986b). ‘Recent advances in error rate estimation’. Pattern Recognition Letters (to appear).Google Scholar
- Hills, M. (1966). ‘Allocation rules and their error rates’. J. R. Statist. Soc. B 28, 1–31.MathSciNetMATHGoogle Scholar
- Hora, S.C. and Wilcox, J.B. (1982). ‘Estimation of error rates in several-population discriminant analysis’. J. Marketing Res. 19, 57–61.CrossRefGoogle Scholar
- Knoke, J.D. (1986). ‘The robust estimation of classification error rates’. Comp. Math, with Appls. 12A, 253–260.MATHCrossRefGoogle Scholar
- Krzanowski, W.J. (1975). ‘Discrimination and classification using both binary and continuous variables’. J. Amer. Statist. Assoc. 70, 782–792.MATHCrossRefGoogle Scholar
- Lachenbruch, P.A. and Mickey, M.R. (1968). ‘Estimation of error rates in discriminant analysis’. Technometrics 10, 1–11.MathSciNetCrossRefGoogle Scholar
- Matloff, N. and Pruitt, R. (1984). ‘The asymptotic distribution of an estimator of the Bayes error rate’. Pattern Recognition Letters 2, 271–274.MATHCrossRefGoogle Scholar
- McLachlan, G.J. (1973). ‘An asymptotic expansion of the expectation of the estimated error rate indiscriminant analysis’. Austral. J. Statist. 15, 210–214.MathSciNetMATHCrossRefGoogle Scholar
- McLachlan, G.J. (1974a). ‘Estimation of the errors of misclassification on the criterion of asymptotic mean square error’. Technometrics 16, 255–260.MathSciNetMATHCrossRefGoogle Scholar
- McLachlan, G.J. (1974b). ‘The relationship in terms of asymptotic mean square error between the separate problems of estimating each of the three types of error rate of the linear discriminant function’. Technometrics 16, 569–575.MathSciNetMATHCrossRefGoogle Scholar
- McLachlan, G.J. (1974c). ‘An asymptotic unbiased technique for estimating the error rates in discriminant analysis’. Biometrics 30, 239–249.MathSciNetMATHCrossRefGoogle Scholar
- McLachlan, G.J. (1976). ‘The bias of the apparent error rate in discriminant analysis’. Biometrika 63, 239–244.MathSciNetMATHCrossRefGoogle Scholar
- McLachlan, G.J. (1977). ‘A note on the choice of a weighting function to give an efficient method for estimating the probability of misclassification’. Pattern Recognition 9, 147–149.MATHCrossRefGoogle Scholar
- McLachlan, G.J. (1980). ‘The efficiency of Efron’s bootstrap approach applied to error rate estimation in discriminant analysis’. J. Statist. Comput. Simul. 11, 273–279.MATHCrossRefGoogle Scholar
- McLachlan, G.J. (1986). ‘Assessing the performance of an allocation rule’. Comp. Maths. with Appls. 12A, 261–272.MATHCrossRefGoogle Scholar
- Page, J.T. (1985). ‘Error-rate estimation in discriminant analysis’. Technometrics 27, 189–198.MathSciNetCrossRefGoogle Scholar
- Rao, P.S.R.S. and Dorvlo, A.S. (1985). The jackknife procedure for the probabilities of misclassification’. Commun. Statist.-Simula. Computa. 14, 779–790.CrossRefGoogle Scholar
- Schwemer, G.T. and Dunn, 0.J. (1980). ‘Posterior probability estimators in classification simulations’. Commun. Statist.-Simula. Computa. B9, 133–140.MathSciNetCrossRefGoogle Scholar
- Snapinn,S.M. and Knoke, J.D. (1984). ‘Classification error rate estimators evaluated by unconditional mean squared error’. Technometrics 26, 371–378.CrossRefGoogle Scholar
- Snapinn,S.M. and Knoke, J.D. (1985). ‘An evaluation of smoothed classification error-rate estimators’. Technometrics 27, 199–206.MathSciNetGoogle Scholar
- Toussaint, G.T. (1974). ‘Bibliography on estimation of misclassification’. IEEE Trans. Inf. Theory IT-20, 472–479.Google Scholar
- Toussaint, G.T. and Sharpe, P.M. (1975), ‘An efficient method for estimating the probability of misclassification applied to a problem in medical diagnosis’. Comput. Biol. Med. 4, 269–278.CrossRefGoogle Scholar
- Tutz, G.E. (1985). ‘Smoothed additive estimators for non-error rates in multiple discriminant analysis’. Pattern Recognition 18, 151–159.MATHCrossRefGoogle Scholar
- Vlachonikolis, I.G. (1986). ‘Estimation of the expected probability of misclassification’. Comp Maths. with Appls. 12A, 187–195.MATHCrossRefGoogle Scholar
- Wang, M-C. (1986). ‘Re-sampling procedures for reducing bias of error rate estimation in multinomial classification’. Comput. Statist. Data Analysis 4, 15–39.MATHCrossRefGoogle Scholar
- Wernecke, K-D and Kalb, G. (1983). ‘Further results in estimating the classification error in discriminance analysis’. Biom. J. 25, 247–258.Google Scholar
- Wernecke, K-D, Kalb, G., and Stürzebecher, E. (1980). ‘Comparison of various procedures for estimation of the classification error in discriminance analysis’. Biom. J. 22, 639–649.MATHCrossRefGoogle Scholar