A result relating convex n-widths to covering numbers with some applications to neural networks

Baxter, Jonathan; Bartlett, Peter

doi:10.1007/3-540-62685-9_21

Jonathan Baxter¹ &
Peter Bartlett¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1208))

Included in the following conference series:

European Conference on Computational Learning Theory

99 Accesses
1 Citations

Abstract

In general, approximating classes of functions defined over high-dimensional input spaces by linear combinations of a fixed set of basis functions or “features” is known to be hard. Typically, the worstcase error of the best basis set decays only as fast as Θ (n ^−1/d, where n is the number of basis functions and d is the input dimension. However, there are many examples of high-dimensional pattern recognition problems (such as face recognition) where linear combinations of small sets of features do solve the problem well. Hence these function classes do not suffer from the “curse of dimensionality” associated with more general classes. It is natural then, to look for characterizations of highdimensional function classes that nevertheless are approximated well by linear combinations of small sets of features. In this paper we give a general result relating the error of approximation of a function class to the covering number of its “convex core”. For one-hidden-layer neural networks, covering numbers of the class of functions computed by a single hidden node upper bound the covering numbers of the convex core. Hence, using standard results we obtain upper bounds on the approximation rate of neural network classes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Jonathan Baxter. Learning Internal Representations. In Proceedings of the Eighth International Conference on Computational Learning Theory, Santa Cruz, California, 1995. ACM Press.
Google Scholar
Shimon Edelman and Nathan Intrator. Learning low dimensional representations of visual objects with extensive use of prior knowledge. In Sebastian Thrun, editor, Explanation-Based Neural Network Learning. Kluwer Academic, 1996. To Appear.
Google Scholar
David Haussler. Sphere packing numbers of the boolean n-cube with bounded VC-dimension. Journal of Combinatorial Theory A, 69:217–232, 1995.
Google Scholar
A N Kolmogorov. Über die beste Annäherung von Funktionen einer gegebenen Funktionenclasse. Ann. Math., 37:107–110, 1936.
Google Scholar
A N Kolmogorov and V M Tihomirov. ε-entropy and ε-capacity of sets in functional spaces. AMS Translations Series 2, 17:277–364, 1961.
Google Scholar
George G Lorentz, Manfred v. Golitschek, and Yuly Makovoz. Constructive Approximation: advanced problems. Springer Verlag, Berlin, 1996.
Google Scholar
Allan Pinkus. n-Widths in Approximation Theory. Springer-Verlag, Berlin, 1985.
Google Scholar
Sebastian Thrun and Tom M Mitchell. Learning One More Thing. Technical Report CMU-CS-94-184, CMU, 1994.
Google Scholar
Aad W van der Vaart and Jon A Wellner. Weak Convergence and Empirical Processes. Springer-Verlag, New York, 1996.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Systems Engineering Research School of Information Sciences and Engineering, Australian National University, 0200, Canberra, Australia
Jonathan Baxter & Peter Bartlett

Authors

Jonathan Baxter
View author publications
You can also search for this author in PubMed Google Scholar
Peter Bartlett
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Shai Ben-David

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Baxter, J., Bartlett, P. (1997). A result relating convex n-widths to covering numbers with some applications to neural networks. In: Ben-David, S. (eds) Computational Learning Theory. EuroCOLT 1997. Lecture Notes in Computer Science, vol 1208. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-62685-9_21

Download citation

DOI: https://doi.org/10.1007/3-540-62685-9_21
Published: 03 June 2005
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-62685-5
Online ISBN: 978-3-540-68431-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics