Abstract
This paper reports an extension of the previous MIT and Caltech’s cortex-like machine vision models of Graph-Based Visual Saliency (GBVS) and Feature Hierarchy Library (FHLIB), to remedy some of the undesirable drawbacks in these early models which improve object recognition efficiency. Enhancements in three areas, a) extraction of features from the most salient region of interest (ROI) and their rearrangement in a ranked manner, rather than random extraction over the whole image as in the previous models, b) exploitation of larger patches in the C1 and S2 layers to improve spatial resolutions, c) a more versatile template matching mechanism without the need of ‘pre-storing’ physical locations of features as in previous models, have been the main contributions of the present work. The improved model is validated using 3 different types of datasets which shows an average of ~7% better recognition accuracy over the original FHLIB model.
Chapter PDF
Similar content being viewed by others
Keywords
References
Treisman, A., Gelade, G.: A feature-integration theory of attention. Cognitive Psychology 12(1), 97–136 (1980)
Itti, L., Koch, C., Niebur, E.: A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence 20(11) (1998)
Itti, L.: Visual Attention. In: The Handbook of Brain Theory and Neural Networks, pp. 1196–1201. MIT Press, Cambridge (2003)
Riesenhuber, M., Poggio, T.: Hierarchical models of object recognition in cortex. Nature Neuroscience 2(11), 1019–1025 (1999)
Riesenhuber, M., Poggio, T.: Models of object recognition. Nature Review (2000)
Serre, T., Wolf, L., Poggio, T.: Object recognition with features inspired by visual cortex. In: CVPR (2005)
Fukushima, K., Miyake, S., Ito, T.: Neocognitron: a neural network model for a mechanism of visual pattern recognition. IEEE Transactions on Systems, Man, and Cybernetics SMC-13(3), 826–834 (1983)
Wysoski, S., Benuskova, L., Kasabov, N.: Fast and adaptive network of spiking neurons for multi-view and pattern recognition, pp. 2563–2575 (2008)
Zhang, W., Deng, H., Diettrich, G., Mortensen, N.: A Hierarchical Object Recognition System Based on Multi-scale Principal Curvature Regions. In: 18th International Conference on Pattern Recognition, ICPR 2006 (2006)
Elazary, L., Itti, I.: A Bayesian model for efficient visual search and recognition, pp. 1338–1352 (2010)
Borji, A., Itti, L.: Scene Classification with a Sparse Set of Salient Regions. In: IEEE International Conference on Robotics and Automation (ICRA), Shanghai, China (February 2011)
Harel, J., Koch, C., Perona, P.: Graph-Based Visual Saliency. MIT Press, Cambridge (2007)
Mutch, J., Lowe, D.: Object class recognition and localisation using sparse features with limited receptive fields. International Journal of Computer Vision 80(1), 45–57 (2008)
Ungerleider, L., Mishkin, M.: Two cortical visual systems. MIT Press, Cambridge (1982)
Hubel, D., Wiesel, T.: Receptive fields and functional architecture of monkey striate cortex. Journal of Physiology 195 (1967)
Tsitiridis, A., Yuen, P., Hong, K., Chen, T., Ibrahim, I., Jackman, J., James, D., Richardson, M.: An improved cortex-like neuromorphic system for target recognitions. In: Remote Sensing SPIE Europe, Toulouse (2010)
Zhang, J., Tan, T., Ma, L.: Invariant Texture Segmentation Via Circular Gabor Filters. In: 16th International Conference on Pattern Recognition, ICPR 2002 (2002)
Fei-Fei, L., Fergus, R., Perona, P.: Learning generative models from few training examples: an incremental bayesian approach tested on 101 object cagories. In: CVPR Workshop on Generative-Model Based Vision (2004)
Serre, T., Wolf, L., Bilecshi, S., Riesenhuber, M., Poggio, T.: Robust Object Recognition with Cortex-like Mechanisms. IEEE Transactions on Pattern Analysis and Machine Intelligence 29(3), 411–425 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 IFIP International Federation for Information Processing
About this paper
Cite this paper
Tsitiridis, A. et al. (2011). Enhanced Object Recognition in Cortex-Like Machine Vision. In: Iliadis, L., Maglogiannis, I., Papadopoulos, H. (eds) Artificial Intelligence Applications and Innovations. EANN AIAI 2011 2011. IFIP Advances in Information and Communication Technology, vol 364. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23960-1_3
Download citation
DOI: https://doi.org/10.1007/978-3-642-23960-1_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23959-5
Online ISBN: 978-3-642-23960-1
eBook Packages: Computer ScienceComputer Science (R0)