Multiple Comparison Procedures for Determining the Optimal Complexity of a Model

Galindo, Pedro L.; Pizarro-Junquera, Joaquín; Guerrero, Elisa

doi:10.1007/3-540-44522-6_82

Pedro L. Galindo⁸,
Joaquín Pizarro-Junquera⁸ &
Elisa Guerrero⁸

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1876))

Included in the following conference series:

Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)

981 Accesses

Abstract

We aim to determine which of a set of competing models is statistically best, that is, on average. A way to define “on average” is to consider the performance of these algorithms averaged over all the training sets that might be drawn from the underlying distribution. When comparing more than two means, an ANOVA F-test tells you whether the means are significantly different, but it does not tell you which means differ from each other. A simple approach is to test each possible difference by a paired t-test. However, the probability of making at least one type I error increases with the number of tests made. Multiple comparison procedures provide different solutions. We discuss these techniques and apply the well known Bonferroni method in order to determine the optimal degree in polynomial fitting and the optimal number of hidden neurons in feedforward neural networks.

Download to read the full chapter text

Chapter PDF

An Important Equivalence Result

Application and Power of Parametric Criteria for Testing the Homogeneity of Variances. Part III

Article 01 April 2017

Confidence curves: an alternative to null hypothesis significance testing for the comparison of classifiers

Article 30 December 2016

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Bishop, C. M.: Neural Network for Pattern Recognition. Clarendon Press-Oxford (1995)
Google Scholar
Cobb, G.W.: Introduction to Design and Analysis of Experiments. Springer-Verlag New York (1998)
MATH Google Scholar
Dean, A., Voss, D.: Design and Analysis of Experiments. Springer Texts in Statistics. Springer-Verlag New York (1999)
Google Scholar
Dietterich, T.G.: Aproximate Statistical Test for Comparing Supervised Classification Learning Algorithms. Neural Computation (1998), Vol. 10, no. 7, 1895–1923
Article Google Scholar
Feelders, A., Verkooijen, W.: On the Statistical Comparison of Inductive Learning Methods. Learning from Data Artificial Intelligence and Statistics V. Springer-Verlag, New York (1996) 271–279
Google Scholar
Hsu, J.C.: Multiple Comparisons: Theory and Methods. Chapman & Hall (1996)
Google Scholar
Jobson, J.D.: Applied Multivariate Data Analysis. Springer Texts in Statistics, Vol 1. Springer-Verlag New York (1991)
Google Scholar
Stone, M.: Cross-validatory Choice and Assesment of Statistical Prediction (with discussion). Journal of the Royal Statistical Society (1974), Series B, 36, 111–147
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Dpto. Lenguajes y Sistemas Informáticos Grupo ”Sistemas Inteligentes de Computación”, Universidad de Cádiz - CASEM, Polígono Río San Pedro, 11510, Puerto Real (Cadiz), Spain
Pedro L. Galindo, Joaquín Pizarro-Junquera & Elisa Guerrero

Authors

Pedro L. Galindo
View author publications
You can also search for this author in PubMed Google Scholar
Joaquín Pizarro-Junquera
View author publications
You can also search for this author in PubMed Google Scholar
Elisa Guerrero
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of València, 46100, Burjassot (València), Spain
Francesc J. Ferri
Department of Computer Languages and Systems, University of Alicante, 03071, Alicante, Spain
José M. Iñesta
School of Computer Science and Engineering, University of New South Wales, Sydney, NSW, 2052, Australia
Adnan Amin
Institute of Information Theory and Automation, Academy of Sciences of the Czech Republic, 182 08, Prague 8, Czech Republic
Pavel Pudil

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Galindo, P.L., Pizarro-Junquera, J., Guerrero, E. (2000). Multiple Comparison Procedures for Determining the Optimal Complexity of a Model. In: Ferri, F.J., Iñesta, J.M., Amin, A., Pudil, P. (eds) Advances in Pattern Recognition. SSPR /SPR 2000. Lecture Notes in Computer Science, vol 1876. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44522-6_82

Download citation

DOI: https://doi.org/10.1007/3-540-44522-6_82
Published: 21 December 2000
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67946-2
Online ISBN: 978-3-540-44522-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Multiple Comparison Procedures for Determining the Optimal Complexity of a Model

Abstract

Chapter PDF