Perception-Action Based Object Detection from Local Descriptor Combination and Reinforcement Learning
This work proposes to learn visual encodings of attention patterns that enables sequential attention for object detection in real world environments. The system embeds a saccadic decision procedure in a cascaded process where visual evidence is probed at informative image locations. It is based on the extraction of information theoretic saliency by determining informative local image descriptors that provide selected foci of interest. The local information in terms of code book vector responses and the geometric information in the shift of attention contribute to recognition states of a Markov decision process. A Q-learner performs then performs search on useful actions towards salient locations, developing a strategy of action sequences directed in state space towards the optimization of information maximization. The method is evaluated in outdoor object recognition and demonstrates efficient performance.
KeywordsObject Recognition Conditional Entropy Recognition State Sift Descriptor Attention Pattern
- 1.Bandera, C., Vico, F.J., Bravo, J.M., Harmon, M.E., Baird III, L.C.: Residual Q-learning applied to visual attention. In: International Conference on Machine Learning, pp. 20–27 (1996)Google Scholar
- 2.Deco, G.: The computational neuroscience of visual cognition: Attention, memory and reward. In: Proc. International Workshop on Attention and Performance in Computational Vision, pp. 49–58 (2004)Google Scholar
- 3.Fritz, G., Paletta, L., Bischof, H.: Object recognition using local information content. In: Proc. International Conference on Pattern Recognition, ICPR 2004, Cambridge, UK, vol. II, pp. 15–18 (2004)Google Scholar
- 4.Fritz, G., Seifert, C., Paletta, L., Bischof, H.: Rapid object recognition from discriminative regions of interest. In: Proc. National Conference on Artificial Intelligence, AAAI 2004, San Jose, CA, pp. 444–449 (2004)Google Scholar
- 10.Tipper, S.P., Grisson, S., Kessler, K.: Long-term inhibition of return of attention. Psychological Science 14, 19–25–105 (2003)Google Scholar