Optimizing neural network architectures using generalization error estimators

Larsen, J.

doi:10.1007/BF01039612

Optimizing neural network architectures using generalization error estimators

Published: September 1994

Volume 37, pages 729–740, (1994)
Cite this article

Radiophysics and Quantum Electronics Aims and scope

J. Larsen

21 Accesses
Explore all metrics

Abstract

This paper addresses the optimization of neural network architectures. It is suggested to optimize the architecture by selecting the model with minimal estimated averaged generalization error. We consider a least-squares (LS) criterion for estimating neural network models, i.e., the associated model weights are estimated by minimizing the LS criterion. The quality of a particular estimated model is measured by the average generalization error. This is defined as the expected squared prediction error on a novel input-output sample averaged over all possible training sets. An essential part of the suggested architecture optimization scheme is to calculate an estimate of the average generalization error. We suggest using the GEN-estimator [9, 10] which allows for dealing with nonlinear, incomplete models, i.e., models which are not capable of modeling the underlying nonlinear relationship perfectly. In most neural network applications, it is impossible to suggest a perfect model, and consequently the ability to handle incomplete models is urgent. A concise derivation of the GEN-estimator is provided, and its qualities are demonstrated by comparative numerical studies.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The Best Neural Network Architecture

A genetic approach to automatic neural network architecture optimization

Article 28 July 2016

Optimizing a Higher Order Neural Network Through Teaching Learning Based Optimization Algorithm

References

H. Akaike, “Fitting autoregressive models for prediction,”Ann. Inst. Statistical Math.,21, 243 (1969).
Google Scholar
N. R. Draper and H. Smith,Applied Regression Analysis, John Wiley and Sons, New York (1981).
Google Scholar
D. B. Fogel,IEEE Trans. Neural Networks,2, No. 5, 490 (1991)
Google Scholar
L. K. Hausen,Neural Networks,6, 393 (1993).
Google Scholar
L. K. Hansen and P. Salamon,IEEE Trans. Pattern Analysis Machine Intelligence,12, No. 10, 993 (1990).
Google Scholar
J. Hertz, A. Krogh, and R. G. Palmer,Introduction to the Theory of Neural Computation, Addison-Wesley, Redwood City, California (1991).
Google Scholar
K. Hornik, M. Stinchcombe, and H. White,Neural Networks,3, No. 5, 551 (1990).
Google Scholar
R. Kannurpatti and G. W. Hart,IEEE Trans. Information Theory,37, No. 5 1441 (1991)
Google Scholar
J. Larsen, in:Neural Networks for Signals, S. Y. Kung, F. Fallside, J. A. Sorensen, and C. A. Kamm (eds.), IEEE, Piscataway, New Jersey (1992), p. 29.
Google Scholar
J. Larsen, “Design of neural network filters,”Ph.D. Thesis, The Technical University of Denmark, Electronics Institute, March, 1993.
J. Moody, in:Proceedings of the First IEEE Workshop on Neural Networks for Signal Processing, B. H. Juang, S. Y. Kung, and C. A. Kamm (eds.), IEEE, Piscataway, New Jersey (1991), p. 1.
Google Scholar
J. Moody, in:Advances in Neural Information Processing Systems 4, Proceedings of the 1991 Conference, J. E. Moody, S. J. Hanson, and R. P. Lippmann (eds.), Morgan Kaufmann Publishers, San Mateo, California (1992), p. 847.
Google Scholar
M. Rosenblatt,Stationary Sequences and Random Fields, Birkhäuser, Boston, Massachusetts (1985).
Google Scholar
G. A. F. Seber and C. J. Wild,Nonlinear Regression, John Wiley and Sons, New York (1989).
Google Scholar

Download references

Authors

J. Larsen
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

The Computational Neural Network Center, Electronics Institute, The Technical University of Denmark, Building 349. Translated from Izvestiya Vysshikh Uchebnykh Zavedenii, Radiofizika, Vol. 37, No. 9, pp. 1131–1147, September, 1994.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Larsen, J. Optimizing neural network architectures using generalization error estimators. Radiophys Quantum Electron 37, 729–740 (1994). https://doi.org/10.1007/BF01039612

Download citation

Received: 16 September 1993
Issue Date: September 1994
DOI: https://doi.org/10.1007/BF01039612

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Optimizing neural network architectures using generalization error estimators

Abstract

Access this article

Similar content being viewed by others

The Best Neural Network Architecture

A genetic approach to automatic neural network architecture optimization

Optimizing a Higher Order Neural Network Through Teaching Learning Based Optimization Algorithm

References

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Optimizing neural network architectures using generalization error estimators

Abstract

Access this article

Similar content being viewed by others

The Best Neural Network Architecture

A genetic approach to automatic neural network architecture optimization

Optimizing a Higher Order Neural Network Through Teaching Learning Based Optimization Algorithm

References

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation