Why over-parameterization of deep neural networks does not overfit?

Zhou, Zhi-Hua

doi:10.1007/s11432-020-2885-6

Why over-parameterization of deep neural networks does not overfit?

Perspective
Published: 14 September 2020

Volume 64, article number 116101, (2021)
Cite this article

Science China Information Sciences Aims and scope Submit manuscript

Zhi-Hua Zhou¹

310 Accesses
19 Citations
Explore all metrics

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Neyshabur B, Tomioka R, Srebro N. Norm-based capacity control in neural networks. In: Proceedings of the 28th Conference on Learing Theory, Paris, 2015. 1376–1401
Zhang C Y, Bengio S, Hardt M, et al. Understanding deep learning requires rethinking generalization. In: Proceedings of the 5th International Conference on Learning Representation, Toulon, 2017
Nagarajan V, Kolter J Z. Uniform convergence may be unable to explain generalization in deep learning. In: Proceedins of Advances in Neural Information Processing Systems, 2019. 11615–11626
Lawrence S, Giles C L, Tsoi A C. Lessons in neural network training: overfitting may be harder than expected. In: Proceedings of the 14th National Conference on Artificial Intelligence, Providence, 1997. 540–545
Liu Y Y, Starzyk J A, Zhu Z. Optimized approximation algorithm in neural networks without overfitting. IEEE Trans Neural Netw, 2008, 19: 983–995
Article Google Scholar
Kulis B. Metric learning: a survey. Found Trends Mach Learn, 2013, 5: 287–363
Article Google Scholar
Davis J V, Kulis B, Jain P, et al. Information-theoretic metric learning. In: Proceedings of the 24th International Conference on Machine Learning, Corvalis, 2007. 209–216

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China (NSFC) (Grant Nos. 61751306, 61921006). The author wants to thank Shen-Huan LYU and Zhi-Hao TAN for discussion and help in figures.

Author information

Authors and Affiliations

National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, 210023, China
Zhi-Hua Zhou

Authors

Zhi-Hua Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhi-Hua Zhou.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhou, ZH. Why over-parameterization of deep neural networks does not overfit?. Sci. China Inf. Sci. 64, 116101 (2021). https://doi.org/10.1007/s11432-020-2885-6

Download citation

Received: 11 April 2020
Accepted: 15 April 2020
Published: 14 September 2020
DOI: https://doi.org/10.1007/s11432-020-2885-6

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Why over-parameterization of deep neural networks does not overfit?

Access this article

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation