Learning Input Features Representations in Deep Learning

Mosca, Alan; Magoulas, George D.

doi:10.1007/978-3-319-46562-3_28

Learning Input Features Representations in Deep Learning

Alan Mosca⁶ &
George D. Magoulas⁶

Conference paper
First Online: 07 September 2016

1355 Accesses
1 Citations
2 Altmetric

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 513))

Abstract

Traditionally, when training supervised classifiers with Backpropagation, the training dataset is a static representation of the learning environment. The error on this training set is then propagated backwards to all the layers, and the gradient of the error with respect to the classifiers parameters is used to update them. However, this process stops when the parameters between the input layer and the next layer are updated. We note that there is a residual error that could be propagated further backwards to the feature vector(s) in order to adapt the representation of the input features, and that using this residual error can lead to improved speed of convergence towards a generalised solution. We present a methodology for applying this new technique to Deep Learning methods, such as Deep Neural Networks and Convolutional Neural Networks.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

1.
We use the notation i, j for Artificial Neural Networks where i indicates the index of the weight in the previous layer of the network and j indicates the index of the weight in the current layer.
2.
The epoch in which the validation set reaches the minimum error.

References

Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning representations by back-propagating errors. Cogn. Model. 5(3), 1 (1988)
Google Scholar
Hecht-Nielsen, R.: Theory of the backpropagation neural network. In: International Joint Conference on Neural Networks, 1989 (IJCNN), pp.593–605. IEEE (1989)
Google Scholar
LeCun, Y., Bengio, Y.: Convolutional networks for images, speech, and time series. Handb. Brain Theory Neural Netw. 3361, 310 (1995)
Google Scholar
Miikkulainen, R., Dyer, M.G.: Natural language processing with modular pdp networks and distributed lexicon. Cogn. Sci. 15(3), 343–399 (1991)
Article Google Scholar
Ciresan, D.C., Meier, U., Gambardella, L.M., Schmidhuber, J.: Deep, big, simple neural nets for handwritten digit recognition. Neural Comput. 22(12), 3207–3220 (2010)
Article Google Scholar
Simard, P.Y., Steinkraus, D., Platt, J.C.: Best practices for convolutional neural networks applied to visual document analysis. http://research.microsoft.com/apps/pubs/default.aspx?id=68920 (2003)
Kingma, D., Ba, J.: Adam: a method for stochastic optimization (2014). arXiv:1412.6980
Lecun, Y., Cortes, C.: The MNIST database of handwritten digits. http://yann.lecun.com/exdb/mnist/
Ioffe, S., Szegedy, C.: Batch normalization: accelerating deep network training by reducing internal covariate shift (2015). arXiv:1502.03167
Krizhevsky, A., Hinton, G.: Learning multiple layers of features from tiny images (2009)
Google Scholar
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Mozer, M.C.: A focused back-propagation algorithm for temporal pattern recognition. Complex Syst. 3(4), 349–381 (1989)
MathSciNet MATH Google Scholar
Goodfellow, I.J., Shlens, J., Szegedy, C.: Explaining and harnessing adversarial examples (2014). arXiv:1412.6572

Download references

Author information

Authors and Affiliations

Department of Computer Science and Information Systems, Birkbeck, University of London, Malet Street, London, WC1E 7HX, UK
Alan Mosca & George D. Magoulas

Authors

Alan Mosca
View author publications
You can also search for this author in PubMed Google Scholar
George D. Magoulas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alan Mosca .

Editor information

Editors and Affiliations

School of Computing and Communications, Lancaster University Bailrigg School of Computing and Communications, Lancaster, United Kingdom
Plamen Angelov
School of Computing, University of Portsmouth School of Computing, Portsmouth, Hampshire, United Kingdom
Alexander Gegov
School of Comp. Sci. & Digital Media, Robert Gordon University School of Comp. Sci. & Digital Media, Aberdeen, United Kingdom
Chrisina Jayne
Ins. of Mathematics, Physics & Comp. Sci, Aberystwyth University Ins. of Mathematics, Physics & Comp. Sci, Aberystwyth, United Kingdom
Qiang Shen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mosca, A., Magoulas, G.D. (2017). Learning Input Features Representations in Deep Learning. In: Angelov, P., Gegov, A., Jayne, C., Shen, Q. (eds) Advances in Computational Intelligence Systems. Advances in Intelligent Systems and Computing, vol 513. Springer, Cham. https://doi.org/10.1007/978-3-319-46562-3_28

Download citation

DOI: https://doi.org/10.1007/978-3-319-46562-3_28
Published: 07 September 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-46561-6
Online ISBN: 978-3-319-46562-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics