Skip to main content
Log in

A Bayesian decision theory approach to variable selection for discrimination

  • Published:
Statistics and Computing Aims and scope Submit manuscript

Abstract

Motivated by examples in spectroscopy, we study variable selection for discrimination in problems with very many predictor variables. Assuming multivariate normal distributions with common variance for the predictor variables within groups, we develop a Bayesian decision theory approach that balances costs for variables against a loss due to classification errors. The approach is computationally intensive, requiring a simulation to approximate the intractable expected loss and a search, using simulated annealing, over a large space of possible subsets of variables. It is illustrated by application to a spectroscopic example with 3 groups, 100 variables, and 71 training cases, where the approach finds subsets of between 5 and 14 variables whose discriminatory power is comparable with that of linear discriminant analysis using principal components derived from the full 100 variables. We study both the evaluation of expected loss and the tuning of the simulated annealing for the example, and conclude that computational effort should be concentrated on the search.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Bernardo J.M. and Smith A.F.M. 1994. Bayesian Theory. John Wiley, Chichester.

    Google Scholar 

  • Brown P.J. 1993. Measurement, Regression, and Calibration. Clarendon Press, Oxford.

    Google Scholar 

  • Brown P.J., Fearn T., and Haque M.S. 1999. Discrimination with many variables. Journal of the American Statistical Association 94: 1320-1329.

    Google Scholar 

  • Brown P.J., Fearn T., and Vannucci M. 1999. The choice of variables in multivariate regression: Anon-conjugate Bayesian decision theory approach. Biometrika 86: 635-648.

    Google Scholar 

  • Dawid A.P. 1981. Some matrix-variate distribution theory: Notational considerations and a Bayesian application. Biometrika 68: 265-274.

    Google Scholar 

  • Dowsland K.A. 1995. Simulated annealing. In: Reeves C.R. (Ed.), Modern Heuristic Techniques for Combinatorial Problems. McGraw-Hill, London, pp. 377-419.

    Google Scholar 

  • Hastie T., Buja A., and Tibshirani R. 1995. Penalized discriminant analysis. Annals of Statistics 23: 73-102.

    Google Scholar 

  • Jolliffe I.T. 1986. Principal Component Analysis. Springer-Verlag, New York.

    Google Scholar 

  • Krzanowski W.J., Jonathan P., McCarthy W.V., and Thomas M.R. 1995. Discriminant analysis with singular covariance matrices: Methods and applications to spectroscopic data. Applied Statistics 44: 105-115.

    Google Scholar 

  • Lundy M. and Mees A. 1986. Convergence of an annealing algorithm. Mathematical Programming 34: 111-124.

    Google Scholar 

  • McLachlan G.J. 1992. Discriminant Analysis and Statistical Pattern Recognition. Wiley, New York.

    Google Scholar 

  • Savage L.J. 1971. Elicitation of personal probabilities and expectations. Journal of the American Statistical Association 66: 783-801.

    Google Scholar 

  • Stone M. and Jonathan P. 1994. Statistical thinking and technique for QSAR and related studies. Part II. Specific methods. Journal of Chemometrics 8: 1-20.

    Google Scholar 

Download references

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Fearn, T., Brown, P.J. & Besbeas, P. A Bayesian decision theory approach to variable selection for discrimination. Statistics and Computing 12, 253–260 (2002). https://doi.org/10.1023/A:1020702927247

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1020702927247

Navigation