Approximation by superpositions of a sigmoidal function

Cybenko, G.

doi:10.1007/BF02551274

Approximation by superpositions of a sigmoidal function

Published: December 1989

Volume 2, pages 303–314, (1989)
Cite this article

Mathematics of Control, Signals and Systems Aims and scope Submit manuscript

G. Cybenko¹

19k Accesses
8576 Citations
54 Altmetric
4 Mentions
Explore all metrics

Abstract

In this paper we demonstrate that finite linear combinations of compositions of a fixed, univariate function and a set of affine functionals can uniformly approximate any continuous function ofn real variables with support in the unit hypercube; only mild conditions are imposed on the univariate function. Our results settle an open question about representability in the class of single hidden layer neural networks. In particular, we show that arbitrary decision regions can be arbitrarily well approximated by continuous feedforward neural networks with only a single internal, hidden layer and any continuous sigmoidal nonlinearity. The paper discusses approximation properties of other possible types of nonlinearities that might be implemented by artificial neural networks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On the Use of Quasi-Sigmoids in Function Approximation Problems with Neural Networks

On Sharpness of Error Bounds for Univariate Approximation by Single Hidden Layer Feedforward Neural Networks

Article Open access 01 July 2020

Constructive function approximation by neural networks with optimized activation functions and fixed weights

Article 09 June 2018

References

[A] R. B. Ash,Real Analysis and Probability, Academic Press, New York, 1972.
Google Scholar
[BH] E. Baum and D. Haussler, What size net gives valid generalization?,Neural Comput. (to appear).
[B] B. Bavarian (ed.), Special section on neural networks for systems and control,IEEE Control Systems Mag.,8 (April 1988), 3–31.
Google Scholar
[BEHW] A. Blumer, A. Ehrenfeucht, D. Haussler, and M. K. Warmuth, Classifying learnable geometric concepts with the Vapnik-Chervonenkis dimension,Proceedings of the 18th Annual ACM Symposium on Theory of Computing, Berkeley, CA, 1986, pp. 273–282.
[BST] L. Brown, B. Schreiber, and B. A. Taylor, Spectral synthesis and the Pompeiu problem,Ann. Inst. Fourier (Grenoble),23 (1973), 125–154.
MathSciNet Google Scholar
[CD] S. M. Carroll and B. W. Dickinson, Construction of neural nets using the Radon transform, preprint, 1989.
[C] G. Cybenko, Continuous Valued Neural Networks with Two Hidden Layers are Sufficient, Technical Report, Department of Computer Science, Tufts University, 1988.
[DS] P. Diaconis and M. Shahshahani, On nonlinear functions of linear combinations,SIAM J. Sci. Statist. Comput.,5 (1984), 175–191.
Article MathSciNet MATH Google Scholar
[F] K. Funahashi, On the approximate realization of continuous mappings by neural networks,Neural Networks (to appear).
[G] L. J. Griffiths (ed.), Special section on neural networks,IEEE Trans. Acoust. Speech Signal Process.,36 (1988), 1107–1190.
Google Scholar
[HSW] K. Hornik, M. Stinchcombe, and H. White, Multi-layer feedforward networks are universal approximators, preprint, 1988.
[HL1] W. Y. Huang and R. P. Lippmann, Comparisons Between Neural Net and Conventional Classifiers, Technical Report, Lincoln Laboratory, MIT, 1987.
[HL2] W. Y. Huang and R.P. Lippmann, Neural Net and Traditional Classifiers, Technical Report, Lincoln Laboratory, MIT, 1987.
[H] P. J. Huber, Projection pursuit,Ann. Statist.,13 (1985), 435–475.
MathSciNet MATH Google Scholar
[J] L. K. Jones, Constructive approximations for neural networks by sigmoidal functions, Technical Report Series, No. 7, Department of Mathematics, University of Lowell, 1988.
[K] A. N. Kolmogorov, On the representation of continuous functions of many variables by superposition of continuous functions of one variable and addition,Dokl. Akad. Nauk. SSSR,114 (1957), 953–956.
MathSciNet MATH Google Scholar
[LF] A. Lapedes and R. Farber, Nonlinear Signal Processing Using Neural Networks: Prediction and System Modeling, Technical Report, Theoretical Division, Los Alamos National Laboratory, 1987.
[L1] R. P. Lippmann, An introduction to computing with neural nets,IEEE ASSP Mag.,4 (April 1987), 4–22.
Article Google Scholar
[L2] G. G. Lorentz, The 13th problem of Hilbert, inMathematical Developments Arising from Hilbert’s Problems (F. Browder, ed.), vol. 2, pp. 419–430, American Mathematical Society, Providence, RI, 1976.
Google Scholar
[MSJ] J. Makhoul, R. Schwartz, and A. El-Jaroudi, Classification capabilities of two-layer neural nets.Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, Glasgow, 1989 (to appear).
[MP] M. Minsky and S. Papert,Perceptrons, MIT Press, Cambridge, MA, 1969.
MATH Google Scholar
[N] N. J. Nilsson,Learning Machines, McGraw-Hill, New York, 1965.
MATH Google Scholar
[P] G. Palm, On representation and approximation of nonlinear systems, Part II: Discrete systems,Biol. Cybernet.,34 (1979), 49–52.
Article MathSciNet MATH Google Scholar
[R1] W. Rudin,Real and Complex Analysis, McGraw-Hill, New York, 1966.
MATH Google Scholar
[R2] W. Rudin,Functional Analysis, McGraw-Hill, New York, 1973.
MATH Google Scholar
[RHM] D. E. Rumelhart, G. E. Hinton, and J. L. McClelland, A general framework for parallel distributed processing, inParallel Distributed Processing: Explorations in the Microstructure of Cognition (D. E. Rumelhart, J. L. McClelland, and the PDP Research Group, eds.), vol. 1, pp. 45–76, MIT Press, Cambridge, MA, 1986.
Google Scholar
[V] L. G. Valiant, A theory of the learnable,Comm. ACM,27 (1984), 1134–1142.
Article MATH Google Scholar
[WL] A Wieland and R. Leighton, Geometric analysis of neural network capabilities,Proceedings of IEEE First International Conference on Neural Networks, San Diego, CA, pp. III-385–III-392, 1987.

Download references

Author information

Authors and Affiliations

Center for Supercomputing Research and Development and Department of Electrical and Computer Engineering, University of Illinois, 61801, Urbana, Illinois, USA
G. Cybenko

Authors

G. Cybenko
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

This research was supported in part by NSF Grant DCR-8619103, ONR Contract N000-86-G-0202 and DOE Grant DE-FG02-85ER25001.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cybenko, G. Approximation by superpositions of a sigmoidal function. Math. Control Signal Systems 2, 303–314 (1989). https://doi.org/10.1007/BF02551274

Download citation

Received: 21 October 1988
Revised: 17 February 1989
Issue Date: December 1989
DOI: https://doi.org/10.1007/BF02551274

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Approximation by superpositions of a sigmoidal function

Abstract

Access this article

Similar content being viewed by others

On the Use of Quasi-Sigmoids in Function Approximation Problems with Neural Networks

On Sharpness of Error Bounds for Univariate Approximation by Single Hidden Layer Feedforward Neural Networks

Constructive function approximation by neural networks with optimized activation functions and fixed weights

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Key words

Navigation

Approximation by superpositions of a sigmoidal function

Abstract

Access this article

Similar content being viewed by others

On the Use of Quasi-Sigmoids in Function Approximation Problems with Neural Networks

On Sharpness of Error Bounds for Univariate Approximation by Single Hidden Layer Feedforward Neural Networks

Constructive function approximation by neural networks with optimized activation functions and fixed weights

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

Search

Navigation