Neural Networks: A Statistician’s (Possible) View
Within the past few years, neural networks (NNs) have emerged as a popular, rather general-purpose means of data processing and analysis. As in most applications they are employed to perform rather standard statistical tasks like regression analysis and classification, one might wonder what is really new about them. We shed some light on this issue from a statistician’s point of view by “translating” neural network terminology into more familiar terms and then discussing some of their most important properties. Particular attention is given to “supervised” classification, i.e., discriminant analysis.
KeywordsNeural Network Hide Unit Training Pattern Empirical Risk Threshold Circuit
Unable to display preview. Download preview PDF.
- AKAIKE, H. (1973): Information theory and an extension of the maximum likelihood principle. In Petrov, B. N. and Csáki, F. (eds.), Second International Symposium on Information Theory, pp. 267–281. Budapest, Hungary: Akademiai Kiado.Google Scholar
- ANTHONY, M. (1994): Probabilistic analysis of learning in artificial neural networks: The PAC model and its variants. Tech. Rep. NC-TR-94-3, NeuroColt Technical Report Series.Google Scholar
- BARRON, A. R. (1992): Neural net approximation. In Narendra, K. (ed.), Proceedings of the 6th Yale Workshop on Adaptive Learning Systems, pp. 69–72. New Haven: Yale University.Google Scholar
- HAYKIN, S. (1994): Neural Networks: A Comprehensive Foundation. New York: Macmillan College Publishing.Google Scholar
- JUDD, J. S. (1990): Neural Network Design and the Complexity of Learning. Cambridge, MA: MIT Press.Google Scholar
- KARPINSKI, M. and MACINTYRE, A. (1994): Polynomial bounds for the VC dimension of sigmoidal neural networks. Tech. Rep. 85116-CS, University of Bonn.Google Scholar
- LIN, J.-H. and VITTER, J. S. (1991): Complexity results on learning by neural nets. Machine Learning, 6, 211–230.Google Scholar
- RIPLEY, B. (1993): Statistical aspects of neural networks. In Barndorff-Nielsen, O. E., Jensen, J. L., and Kendall, W. S. (eds.), Networks and Chaos—Statistical and Probabilistic Aspects, vol. 50 of Monographs on Statistics and Applied Probability, pp. 40–123. London: Chapman and Hall.Google Scholar
- RIPLEY, B. (1996): Pattern Recognition and Neural Networks. Cambridge University Press.Google Scholar
- STONE, M. (1974): Cross-validatory choice and assessment of statistical predictions. Journal of the Royal Statistical Society Series B, 36, 111–147.Google Scholar
- VAPNIK, V. N. (1995): The nature of statistical learning theory. New York: Springer.Google Scholar
- VAPNIK, V. N. and Chervonenkis, A. Y. (1991): The necessary and sufficient conditions for consistency of the method of empirical risk minimization. Pattern Recognition and Image Analysis, 1(3), 284–305.Google Scholar