Block diagrams and splitting criteria for classification trees
Rent the article at a discountRent now
* Final gross prices may vary according to local VAT.Get Access
Various aspects of the classification tree methodology of Breiman et al., (1984) are discussed. A method of displaying classification trees, called block diagrams, is developed. Block diagrams give a clear presentation of the classification, and are useful both to point out features of the particular data set under consideration and also to highlight deficiencies in the classification method being used. Various splitting criteria are discussed; the usual Gini-Simpson criterion presents difficulties when there is a relatively large number of classes and improved splitting criteria are obtained. One particular improvement is the introduction of ‘adaptive anti-end-cut factors’ that take advantage of highly asymmetrical splits where appropriate. They use the number and mix of classes in the current node of the tree to identify whether or not it is likely to be advantageous to create a very small offspring node. A number of data sets are used as examples.
- Breiman, L., Friedman, J. H., Olshen, R. A. and Stone, C. J. (1984) Classification and Regression Trees, Wadsworth, Belmont, California.
- de Dombal, F. T., Leaper, D. J., Staniland, J. R., McCann, A. P. and Horrocks, J. C. (1972) Computer-aided diagnosis of acute abdominal pain. British Medical Journal, 2, 9–13.
- Jones, M. C. and Sibson, R. (1987) What is projection pursuit? (with Discussion). Journal of the Royal Statistical Society, 150, 1–36.
- Loh, W.-Y. and Vanichsetakul, N. (1988) Tree-structured classification via generalized discriminant analysis. Journal of the American Statistical Association, 83, 715–728.
- Lubischew, A. A. (1962) On the use of discriminant functions in taxonomy. Biometrics, 18, 455–477.
- Taylor, P. C. (1990) Classification Trees, PhD Thesis, School of Mathematical Sciences, University of Bath, UK.
- Block diagrams and splitting criteria for classification trees
Statistics and Computing
Volume 3, Issue 4 , pp 147-161
- Cover Date
- Print ISSN
- Online ISSN
- Kluwer Academic Publishers
- Additional Links
- Classification tree methodology
- Gini-Simpson criterion
- anti-end-cut factors
- mean posterior improvement criterion
- Industry Sectors