Convolutional Elman Jordan Neural Network for Reconstruction and Classification Using Attention Window

Kumari, Sweta; Aravindakshan, S.; Jain, Umangi; Srinivasa Chakravarthy, V.

doi:10.1007/978-981-15-6067-5_20

Sweta Kumari¹⁹,
S. Aravindakshan¹⁹,
Umangi Jain¹⁹ &
…
V. Srinivasa Chakravarthy¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1189))

879 Accesses
1 Citations

Abstract

In deep learning-based visual pattern recognition systems, typically the entire image is presented to the system for recognition. However, the human visual system often scans a large visual object by sequential shifts of attention, which is integrated for visual classification. Even in artificial domains, such sequential integration is particularly useful when the input image is too large. Some previous studies based on Elman and Jordan networks have explored only with fully connected layers using full image as input but not with convolutional layers using attention window as input. To this end, we present a novel recurrent neural network architecture which possesses spatiotemporal memory called Convolutional Elman Jordan Neural Network (CEJNN) to integrate the information by looking at a series of small attentional windows applied over the full image. Two variations of CEJNN with some modifications have been developed for two tasks: reconstruction and classification. The network is trained on 48 K images and tested on 10 K images of MNIST handwritten digit database for both tasks. Our experiment shows that the network captures better correlation of the spatiotemporal information by providing the result with a mean square error (MSE) of 0.012 for reconstruction task and also claiming the classification with 97.62% accuracy on the testing set.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 219.00; Price excludes VAT (USA)

Softcover Book: USD 279.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

V. Mnih, N. Heess, A. Graves, Recurrent models of visual attention, in Advances in neural information processing systems (2014), pp. 2204–2212
Google Scholar
J.E. Hoffman, C.W. Eriksen, Temporal and spatial characteristics of selective coding from visual displays temporal and spatial characteristics of selective encoding from visual displays*. Percept. Psychophys. 12, 201–204 (1972). https://doi.org/10.3758/BF03212870
Article Google Scholar
A.M. Treisman, G. Gelade, A feature-integration theory of attention. Cogn. Psychol. 12, 97–136 (1980)
Article Google Scholar
C. Koch, S. Ullman, Shifts in selective visual attention: towards the underlying neural circuitry, in Matters of intelligence. Springer (1987), pp. 115–141
Google Scholar
C.E. Connor, H.E. Egeth, S. Yantis, Visual attention: bottom-up versus top-down (Curr, Biol, 2004)
Google Scholar
Q. Lai, W. Wang, S. Khan, et al., Human versus machine attention in neural networks: a comparative study (2019)
Google Scholar
L. Itti, C. Koch, E. Niebur, A model of saliency-based visual attention for rapid scene analysis. IEEE Trans Pattern Anal Mach Intell 20, 1254–1259 (1998)
Article Google Scholar
M.J. Tovée, An introduction to the visual system. Cambridge University Press (1996)
Google Scholar
H.T. Siegelmann, E.D. Sontag, On the computational power of neural nets. J. Comput. Syst. Sci. 50, 132–150 (1995). https://doi.org/10.1006/jcss.1995.1013
Article MathSciNet MATH Google Scholar
D.E. Rumelhart, G.E. Hinton, R.J. Williams, Learning representations by back-propagating errors. Nature 323, 533–536 (1986). https://doi.org/10.1038/323533a0
Article MATH Google Scholar
J. Cheng, L. Dong, M. Lapata, Long short-term memory-networks for machine reading (2013)
Google Scholar
A. Vaswani, Attention is all you need (2017)
Google Scholar
B. Singh, T.K. Marks, M. Jones, O. Tuzel, A multi-stream bi-directional recurrent neural network for fine-grained action detection 1961–1970 (1961)
Google Scholar
S. Bai, J.Z. Kolter, V. Koltun, An empirical evaluation of generic convolutional and recurrent networks for sequence modeling. arXiv Prepr arXiv180301271 (2018)
Google Scholar
S. Hochreiter, J. Schmidhuber, Long short-term memory. Neural Comput. 9, 1735–1780 (1997)
Article Google Scholar
J. Chung, Gated feedback recurrent neural networks 37 (2015)
Google Scholar
C. Lea, M.D.F. Ren, A. Reiter, G.D. Hager, Temporal convolutional networks for action segmentation and detection, pp. 156–165
Google Scholar
J.L. Elman, Distributed representations, simple recurrent networks, and grammatical structure. Mach. Learn. 7, 195–225 (1991). https://doi.org/10.1023/A:1022699029236
Article Google Scholar
M.I. Jordan, Serial order: a parallel distributed processing approach (University of California, San Diego Inst Cogn Sci, 1986), p. 8604
Google Scholar
R.H. Hahnloser, R. Sarpeshkar, M.A. Mahowald, R.J. Douglas, H.S. Seung, Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit. Nature 442, 947–951 (2000)
Article Google Scholar
Y. LeCun, B. Boser, J.S. Denker, et al. Lecun-89E. Neural Comput (1989)
Google Scholar
D.P. Kingma, J. Ba, Adam: a method for stochastic optimization (2014), pp. 1–15
Google Scholar
Recognition of handwritten digits using artificial neural networks
Google Scholar

Download references

Author information

Authors and Affiliations

Laboratory of Computational Neuroscience, Department of Biotechnology, Bhupat and Jyoti Mehta School of Biosciences, IIT Madras, Chennai, Tamil Nadu, India
Sweta Kumari, S. Aravindakshan, Umangi Jain & V. Srinivasa Chakravarthy

Authors

Sweta Kumari
View author publications
You can also search for this author in PubMed Google Scholar
S. Aravindakshan
View author publications
You can also search for this author in PubMed Google Scholar
Umangi Jain
View author publications
You can also search for this author in PubMed Google Scholar
V. Srinivasa Chakravarthy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to V. Srinivasa Chakravarthy .

Editor information

Editors and Affiliations

Department of Computer and Communication Engineering, Manipal University Jaipur, Jaipur, Rajasthan, India
Manoj Kumar Sharma
Department of Computer and Communication Engineering, Manipal University Jaipur, Jaipur, Rajasthan, India
Vijaypal Singh Dhaka
Department of Computer Science, Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, Serdang, Selangor, Malaysia
Thinagaran Perumal
Department of Information Technology, Techno India College of Technology, Kolkata, West Bengal, India
Nilanjan Dey
Faculdade de Engenharia da, Universidade do Porto, Porto, Portugal
João Manuel R. S. Tavares

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kumari, S., Aravindakshan, S., Jain, U., Srinivasa Chakravarthy, V. (2021). Convolutional Elman Jordan Neural Network for Reconstruction and Classification Using Attention Window. In: Sharma, M.K., Dhaka, V.S., Perumal, T., Dey, N., Tavares, J.M.R.S. (eds) Innovations in Computational Intelligence and Computer Vision. Advances in Intelligent Systems and Computing, vol 1189. Springer, Singapore. https://doi.org/10.1007/978-981-15-6067-5_20

Download citation

DOI: https://doi.org/10.1007/978-981-15-6067-5_20
Published: 22 September 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-6066-8
Online ISBN: 978-981-15-6067-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics