The Dipping Phenomenon

Loog, Marco; Duin, Robert P. W.

doi:10.1007/978-3-642-34166-3_34

Marco Loog²⁴ &
Robert P. W. Duin²⁴

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7626))

Included in the following conference series:

Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR)

2656 Accesses
6 Citations

Abstract

One typically expects classifiers to demonstrate improved performance with increasing training set sizes or at least to obtain their best performance in case one has an infinite number of training samples at ones’s disposal. We demonstrate, however, that there are classification problems on which particular classifiers attain their optimum performance at a training set size which is finite. Whether or not this phenomenon, which we term dipping, can be observed depends on the choice of classifier in relation to the underlying class distributions. We give some simple examples, for a few classifiers, that illustrate how the dipping phenomenon can occur. Additionally, we speculate about what generally is needed for dipping to emerge. What is clear is that this kind of learning curve behavior does not emerge due to mere chance and that the pattern recognition practitioner ought to take note of it.

Download to read the full chapter text

Chapter PDF

Confidence curves: an alternative to null hypothesis significance testing for the comparison of classifiers

Article 30 December 2016

LCDB 1.0: An Extensive Learning Curves Database for Classification Tasks

Using p-values for the comparison of classifiers: pitfalls and alternatives

Article 11 April 2022

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Amari, S., Fujita, N., Shinomoto, S.: Four types of learning curves. Neural Computation 4(4), 605–618 (1992)
Article Google Scholar
Ben-David, S., Srebro, N., Urner, R.: Universal learning vs. no free lunch results. In: Philosophy and Machine Learning Workshop @ NIPS 2011 (December 2011), http://www.dsi.unive.it/PhiMaLe2011/
Duda, R., Hart, P.: Pattern classification and scene analysis. John Wiley & Sons (1973)
Google Scholar
Duin, R.: Small sample size generalization. In: Proceedings of the Scandinavian Conference on Image Analysis, vol. 2, pp. 957–964 (1995)
Google Scholar
Haussler, D., Kearns, M., Seung, H., Tishby, N.: Rigorous learning curve bounds from statistical mechanics. Machine Learning 25(2), 195–236 (1996)
Article MATH Google Scholar
Hughes, G.: On the mean accuracy of statistical pattern recognizers. IEEE Transactions on Information Theory 14(1), 55–63 (1968)
Article Google Scholar
Jain, A., Duin, R., Mao, J.: Statistical pattern recognition: A review. IEEE Transactions on Pattern Analysis and Machine Intelligence 22(1), 4–37 (2000)
Article Google Scholar
Krämer, N.: On the peaking phenomenon of the lasso in model selection. Arxiv preprint arXiv:0904.4416 (2009)
Google Scholar
Langley, P.: Machine learning as an experimental science. Machine Learning 3(1), 5–8 (1988)
MathSciNet Google Scholar
McLachlan, G.: Discriminant Analysis and Statistical Pattern Recognition. John Wiley & Sons (1992)
Google Scholar
Opper, M.: Learning to generalize. In: Frontiers of Life, vol. 3(part 2), pp. 763–775. Academic Press (2001)
Google Scholar
Opper, M., Kinzel, W.: Statistical mechanics of generalization. In: Models of Neural Networks III, ch. 5. Springer (1995)
Google Scholar
Raudys, S., Duin, R.: Expected classification error of the fisher linear classifier with pseudo-inverse covariance matrix. Pattern Recognition Letters 19(5), 385–392 (1998)
Article MATH Google Scholar
Skurichina, M., Duin, R.: Stabilizing classifiers for very small sample sizes. In: Proceedings of the 13th International Conference on Pattern Recognition, vol. 2, pp. 891–896. IEEE (1996)
Google Scholar
Steinwart, I.: Consistency of support vector machines and other regularized kernel classifiers. IEEE Transactions on Information Theory 51(1), 128–142 (2005)
Article MathSciNet Google Scholar
Vapnik, V.: Estimation of dependences based on empirical data. Springer (1982)
Google Scholar

Download references

Author information

Authors and Affiliations

Pattern Recognition Laboratory, Delft University of Technology, Delft, The Netherlands
Marco Loog & Robert P. W. Duin

Authors

Marco Loog
View author publications
You can also search for this author in PubMed Google Scholar
Robert P. W. Duin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Auckland, Private Bag 92019, 1142, Auckland, New Zealand
Georgy Gimel’farb
Department of Computer Science, University of York, Deramore Lane, YO10 5GH, York, UK
Edwin Hancock
Institute of Media and Information Technology, Chiba University, Yayoi-cho 1-33, 263-8522, Inage-ku, Chiba, Japan
Atsushi Imiya
Technische Universität/Fraunhofer IGD, Fraunhoferstraße 5, 64283, Darmstadt, Germany
Arjan Kuijper
Graduate School of Information Science and Technology, Hokkaido University, 060-0814, Sapporo, Japan
Mineichi Kudo
Graduate School of Engineering, Tohoku University, 6-6-05 Aoba, Aramaki, Aoba-ku, 980-8579, Sendai, Miyagi, Japan
Shinichiro Omachi
Centre for Vision, Speech and Signal Processing, University of Surrey, GU2 7XH, Guildford, Surrey, UK
Terry Windeatt
C&C Innovation Research Laboratories, NEC Corporation, 8916-47 Takayama-cho, Ikoma-Shi, Nara, Japan
Keiji Yamada

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Loog, M., Duin, R.P.W. (2012). The Dipping Phenomenon. In: Gimel’farb, G., et al. Structural, Syntactic, and Statistical Pattern Recognition. SSPR /SPR 2012. Lecture Notes in Computer Science, vol 7626. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34166-3_34

Download citation

DOI: https://doi.org/10.1007/978-3-642-34166-3_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34165-6
Online ISBN: 978-3-642-34166-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

The Dipping Phenomenon

Abstract

Chapter PDF

Similar content being viewed by others

Confidence curves: an alternative to null hypothesis significance testing for the comparison of classifiers

LCDB 1.0: An Extensive Learning Curves Database for Classification Tasks

Using p-values for the comparison of classifiers: pitfalls and alternatives

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

The Dipping Phenomenon

Abstract

Chapter PDF

Similar content being viewed by others

Confidence curves: an alternative to null hypothesis significance testing for the comparison of classifiers

LCDB 1.0: An Extensive Learning Curves Database for Classification Tasks

Using p-values for the comparison of classifiers: pitfalls and alternatives

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation