Deconstructing the generalization gap

Gromov, Andrey

doi:10.1038/s42256-023-00766-7

Neural networks

Deconstructing the generalization gap

News & Views
Published: 18 December 2023

Volume 5, pages 1340–1341, (2023)
Cite this article

From

View current issue Submit your manuscript

Andrey Gromov^1,2

810 Accesses
1 Altmetric
Explore all metrics

New research reveals a duality between neural network weights and neuron activities that enables a geometric decomposition of the generalization gap. The framework provides a way to interpret the effects of regularization schemes such as stochastic gradient descent and dropout on generalization — and to improve upon these methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

**Fig. 1: Two methods to reduce the generalization gap.**

References

Neyshabur, B., Tomioka, R. & Srebro, N. Preprint at https://doi.org/10.48550/arXiv.1412.6614 (2014).
Zhang, C., Bengio, S., Hardt, M., Recht, B. & Vinyals, O. In Int. Conf. Learning Representations (ICLR) 2017 https://openreview.net/forum?id=Sy8gdB9xx (2022).
Feng, Y., Zhang, W. & Tu, Y. Nat. Mach. Intell. 5, 908–918 (2023).
Article Google Scholar

Download references

Author information

Authors and Affiliations

Condensed Matter Theory Center, University of Maryland, College Park, MD, USA
Andrey Gromov
Department of Physics, University of Maryland, College Park, MD, USA
Andrey Gromov

Authors

Andrey Gromov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrey Gromov.

Ethics declarations

Competing interests

The author declares no competing interests.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gromov, A. Deconstructing the generalization gap. Nat Mach Intell 5, 1340–1341 (2023). https://doi.org/10.1038/s42256-023-00766-7

Download citation

Published: 18 December 2023
Issue Date: December 2023
DOI: https://doi.org/10.1038/s42256-023-00766-7
Springer Nature Limited

Associated content

Activity–weight duality in feed-forward neural networks reveals two co-determinants for generalization

Article Nature Machine Intelligence 14 August 2023

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Deconstructing the generalization gap

From

Access this article

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation