A Vector-Contraction Inequality for Rademacher Complexities

Maurer, Andreas

doi:10.1007/978-3-319-46379-7_1

A Vector-Contraction Inequality for Rademacher Complexities

Andreas Maurer¹⁶

Conference paper
First Online: 21 September 2016

1901 Accesses
25 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9925))

Abstract

The contraction inequality for Rademacher averages is extended to Lipschitz functions with vector-valued domains, and it is also shown that in the bounding expression the Rademacher variables can be replaced by arbitrary iid symmetric and sub-gaussian variables. Example applications are given for multi-category learning, K-means clustering and learning-to-learn.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Bartlett, P.L., Mendelson, S.: Rademacher and Gaussian complexities: risk bounds and structural results. J. Mach. Learn. Res. 3, 463–482 (2002)
MathSciNet MATH Google Scholar
Baxter, J.: A model of inductive bias learning. J. Artif. Intell. Res. 12, 149–198 (2000)
MathSciNet MATH Google Scholar
Biau, G., Devroye, L., Lugosi, G.: On the performance of clustering in Hilbert spaces. IEEE Trans. Inf. Theory 54(2), 781–790 (2008)
Article MathSciNet MATH Google Scholar
Boucheron, S., Lugosi, G., Massart, P.: Concentration Inequalities. Oxford University Press, Oxford (2013)
Book MATH Google Scholar
Caponnetto, A., De Vito, E.: Optimal rates for regularized least-squares algorithm. Found. Comput. Math. 7, 331–368 (2007)
Article MathSciNet MATH Google Scholar
Chapelle, O., Wu, M.: Gradient descent optimization of smoothed information retrieval metrics. Inf. Retr. 13(3), 216–235 (2010)
Article Google Scholar
Chaudhuri, S., Tewari, A.: Generalization bounds for learning to rank: does the length of document lists matter? In: ICML 2015 (2015)
Google Scholar
Ciliberto, C., Poggio, T., Rosasco, L.: Convex learning of multiple tasks and their structure (2015). arXiv preprint: arXiv:1504.03101
Crammer, K., Singer, Y.: On the algorithmic implementation of multiclass kernel-based vector machines. J. Mach. Learn. Res. 2, 265–292 (2002)
MATH Google Scholar
Kakade, S.M., Shalev-Shwartz, S., Tewari, A.: Regularization techniques for learning with matrices. J. Mach. Learn. Res. 13, 1865–1890 (2012)
MathSciNet MATH Google Scholar
Koltchinskii, V., Panchenko, D.: Empirical margin distributions and bounding the generalization error of combined classifiers. Ann. Stat. 30(1), 1–50 (2002)
MathSciNet MATH Google Scholar
Ledoux, M., Talagrand, M.: Probability in Banach Spaces: Isoperimetry and Processes. Springer, Berlin (1991)
Book MATH Google Scholar
Lei, Y., Dogan, U., Binder, A., Kloft, M.: Multi-class SVMs: from tighter data-dependent generalization bounds to novel algorithms. In: Advances in Neural Information Processing Systems, pp. 2026–2034 (2015)
Google Scholar
Maurer, A.: Transfer bounds for linear feature learning. Mach. Learn. 75(3), 327–350 (2009)
Article Google Scholar
Maurer, A., Pontil, M.: K-dimensional coding schemes in Hilbert spaces. IEEE Trans. Inf. Theory 56(11), 5839–5846 (2010)
Article MathSciNet Google Scholar
Maurer, A., Pontil, M., Romera-Paredes, B.: The benefit of multitask representation learning. J. Mach. Learn. Res. 17(81), 1–32 (2016)
MathSciNet MATH Google Scholar
McDonald, D.J., Shalizi, C.R., Schervish, M.: Generalization error bounds for stationary autoregressive models (2011). arXiv preprint: arXiv:1103.0942
Meir, R., Zhang, T.: Generalization error bounds for Bayesian mixture algorithms. J. Mach. Learn. Res. 4, 839–860 (2003)
MathSciNet MATH Google Scholar
Michelli, C.A., Pontil, M.: On learning vector-valued functions. J. Mach. Learn. Res. 6, 615–637 (2005)
MathSciNet Google Scholar
Mroueh, Y., Poggio, T., Rosasco, L., Slotine, J.J.: Multiclass learning with simplex coding. In: Advances in Neural Information Processing Systems, pp. 2789–2797 (2012)
Google Scholar
Slepian, D.: The one-sided barrier problem for Gaussian noise. Bell Syst. Tech. J. 41, 463–501 (1962)
Article MathSciNet Google Scholar
Szarek, S.: On the best constants in the Khintchine inequality. Stud. Math. 58, 197–208 (1976)
MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Adalbertstr. 55, 80799, Munich, Germany
Andreas Maurer

Authors

Andreas Maurer
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andreas Maurer .

Editor information

Editors and Affiliations

Montanuniversität Leoben , Leoben, Austria
Ronald Ortner
Ruhr-Uni-Bochum , Bochum, Germany
Hans Ulrich Simon
University of Regina , Regina, Saskatchewan, Canada
Sandra Zilles

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Maurer, A. (2016). A Vector-Contraction Inequality for Rademacher Complexities. In: Ortner, R., Simon, H., Zilles, S. (eds) Algorithmic Learning Theory. ALT 2016. Lecture Notes in Computer Science(), vol 9925. Springer, Cham. https://doi.org/10.1007/978-3-319-46379-7_1

Download citation

DOI: https://doi.org/10.1007/978-3-319-46379-7_1
Published: 21 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46378-0
Online ISBN: 978-3-319-46379-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics