Abstract
A novel proposal for a general model for object recognition is presented. The proposed method is based on symbol-relational grammars and Bayesian networks. An object is modeled as a hierarchy of features and spatial relationships using a symbol-relational grammar. This grammar is learned automatically from examples, incorporating a simple segmentation algorithm in order to generate the lexicon. The grammar is created with the elements of the lexicon as terminal elements. This representation is automatically transformed into a Bayesian network structure which parameters are learned from examples. Thus, recognition is based on probabilistic inference in the Bayesian network representation. Preliminary results in modeling natural objects are presented. The main contribution of this work is a general methodology for building object recognition systems which combines the expressivity of a grammar with the robustness of probabilistic inference.
Chapter PDF
Similar content being viewed by others
References
Chang, L., Jin, Y., Zhang, W., Borenstein, E.: Context, Computation, and Optimal ROC Performance in Hierarchical Models. International Journal of Computer Vision 93(2), 117–140 (2011)
Felzenszwalb, P.F.: Object Detection Grammars. In: ICCV Workshops, p. 691. IEEE, Barcelona (2011)
Ferrucci, F., Pacini, G., Satta, G., Sessa, M.I., Tortora, G., Tucci, M., Vitiello, G.: Symbol-relation grammars: a formalism for graphical languages. Inf. Comput. 131(1), 1–46 (1996)
Gabor, D.: Theory of Communication. JIEE 93(3), 429–459 (1946)
Griffin, G., Holub, A., Perona, P.: Caltech-256 Object Category Dataset. Technical Report 7694, California Institute of Technology (2007)
Meléndez, A., Sucar, L.E., Morales, E.F.: A Visual Grammar for Face Detection. In: Kuri-Morales, A., Simari, G.R. (eds.) IBERAMIA 2010. LNCS, vol. 6433, pp. 493–502. Springer, Heidelberg (2010)
Ommer, B., Buhmann, J.M.: Learning Compositional Categorization Models. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3953, pp. 316–329. Springer, Heidelberg (2006)
Pearl, J.: Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann, San Francisco (1988)
Zhu, S.C., Mumford, D.: A Stochastic Grammar of Images. Foundations and Trends in Computer Graphics and Vision 2(4), 259–362 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ruiz, E., Sucar, L.E. (2014). An Object Recognition Model Based on Visual Grammars and Bayesian Networks. In: Klette, R., Rivera, M., Satoh, S. (eds) Image and Video Technology. PSIVT 2013. Lecture Notes in Computer Science, vol 8333. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-53842-1_30
Download citation
DOI: https://doi.org/10.1007/978-3-642-53842-1_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-53841-4
Online ISBN: 978-3-642-53842-1
eBook Packages: Computer ScienceComputer Science (R0)