Classification and Trees

Devroye, Luc

doi:10.1007/978-3-642-14980-1_3

Classification and Trees

Luc Devroye²¹

Conference paper

1770 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6218))

Abstract

Breiman, Friedman, Gordon and Stone recognized that tree classifiers would be very valuable to practicing statisticians. Their cart algorithm became very popular indeed. Designing tree-based classifiers, however, has its pitfalls. It is easy to make them too simple or too complicated so that Bayes risk consistency is compromised. In this talk, we explore the relationship between algorithmic complexity of tree-based methods and performance.

The author’s research was sponsored by NSERC Grant A3456.

Download to read the full chapter text

Chapter PDF

References

Amit, Y., Geman, D.: Shape quantization and recognition with randomized trees. Neural Computation 9, 1545–1588 (1997)
Article Google Scholar
Biau, G., Devroye, L., Lugosi, G.: Consistency of random forests and other averaging classifiers. Journal of Machine Learning Research 9, 2015–2033 (2008)
MathSciNet Google Scholar
Biau, G., Devroye, L.: On the layered nearest neighbour estimate, the bagged nearest neighbour estimate and the random forest method in regression and classification. Technical Report (2008)
Google Scholar
Breiman, L.: Bagging predictors. Machine Learning 24, 123–140 (1996)
MATH MathSciNet Google Scholar
Breiman, L.: Arcing classifiers. The Annals of Statistics 24, 801–849 (1998)
MathSciNet Google Scholar
Breiman, L.: Some infinite theory for predictor ensembles. Technical Report 577, Statistics Department, UC Berkeley (2000), http://www.stat.berkeley.edu/~breiman
Breiman, L.: Random forests. Machine Learning 45, 5–32 (2001)
Article MATH Google Scholar
Breiman, L.: Consistency for a simple model of random forests. Technical Report 670, Statistics Department, UC Berkeley (2004), http://www.stat.berkeley.edu/~breiman
Breiman, L., Friedman, J., Olshen, R., Stone, C.: Classification and Regression Trees. CRC Press, Boca Raton (1984)
MATH Google Scholar
Cutler, A., Zhao, G.: Pert – Perfect random tree ensembles. Computing Science and Statistics 33, 490–497 (2001)
Google Scholar
Devroye, L., Györfi, L., Lugosi, G.: A Probabilistic Theory of Pattern Recognition. Springer, New York (1996)
MATH Google Scholar
Dietterich, T.G.: An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization. Machine Learning 40, 139–157 (2000)
Article Google Scholar
Dietterich, T.G.: Ensemble methods in machine learning. In: Kittler, J., Roli, F. (eds.) MCS 2000. LNCS, vol. 1857, pp. 1–15. Springer, Heidelberg (2000)
Chapter Google Scholar
Freund, Y., Shapire, R.: Experiments with a new boosting algorithm. In: Saitta, L. (ed.) Machine Learning: Proceedings of the 13th International Conference, pp. 148–156. Morgan Kaufmann, San Francisco (1996)
Google Scholar
Lin, Y., Jeon, Y.: Random forests and adaptive nearest neighbors. Journal of the American Statistical Association 101, 578–590 (2006)
Article MATH MathSciNet Google Scholar
Nilsson, N.J.: Learning Machines: Foundations of Trainable Pattern Classifying Systems. McGraw-Hill, New York (1965)
MATH Google Scholar
Rosenblatt, F.: Principles of Neurodynamics: Perceptrons and the Theory of Brain Mechanisms. Spartan Books, Washington (1962)
MATH Google Scholar
Stoller, D.S.: Univariate two-population distribution-free discrimination. Journal of the American Statistical Association 49, 770–777 (1954)
Article MATH MathSciNet Google Scholar
Vapnik, V.N., Chervonenkis, A.: On the uniform convergence of relative frequencies of events to their probabilities. Theory of Probability and its Applications 16, 264–280 (1971)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science, McGill University, Montreal, Canada, H3A 2K6
Luc Devroye

Authors

Luc Devroye
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Vision and Pattern Recognition Group,Computer Science, University of York Heslington, YO10-5DD, York, United Kingdom
Edwin R. Hancock
Department of Computer Science, University of York, YO10 5DD, UK
Richard C. Wilson
Centre for Vision, Speech and Signal Proc (CVSSP), University of Surrey, Guildford, GU2 7XH, Surrey, United Kingdom
Terry Windeatt
Electrical and Electronics Engineering Department, Middle East Technical University, 06531, Ankara, Turkey
Ilkay Ulusoy
Department of Computer Science and Artificial Intelligence, University of Alicante, P.O.B. 99, E-03080, Alicante, Spain
Francisco Escolano

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Devroye, L. (2010). Classification and Trees. In: Hancock, E.R., Wilson, R.C., Windeatt, T., Ulusoy, I., Escolano, F. (eds) Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2010. Lecture Notes in Computer Science, vol 6218. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14980-1_3

Download citation

DOI: https://doi.org/10.1007/978-3-642-14980-1_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14979-5
Online ISBN: 978-3-642-14980-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)