Skip to main content
Log in

Statistical physics and practical training of soft-committee machines

  • Published:
The European Physical Journal B - Condensed Matter and Complex Systems Aims and scope Submit manuscript

Abstract:

Equilibrium states of large layered neural networks with differentiable activation function and a single, linear output unit are investigated using the replica formalism. The quenched free energy of a student network with a very large number of hidden units learning a rule of perfectly matching complexity is calculated analytically. The system undergoes a first order phase transition from unspecialized to specialized student configurations at a critical size of the training set. Computer simulations of learning by stochastic gradient descent from a fixed training set demonstrate that the equilibrium results describe quantitatively the plateau states which occur in practical training procedures at sufficiently small but finite learning rates.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

Author information

Authors and Affiliations

Authors

Additional information

Received 16 December 1998

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ahr, M., Biehl, M. & Urbanczik, R. Statistical physics and practical training of soft-committee machines. Eur. Phys. J. B 10, 583–588 (1999). https://doi.org/10.1007/s100510050889

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1007/s100510050889

Navigation